1
|
Xu Y, Das P, McCord RP, Shen T. Node features of chromosome structure networks and their connections to genome annotation. Comput Struct Biotechnol J 2024; 23:2240-2250. [PMID: 38827231 PMCID: PMC11140560 DOI: 10.1016/j.csbj.2024.05.026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Revised: 05/14/2024] [Accepted: 05/15/2024] [Indexed: 06/04/2024] Open
Abstract
The 3D conformations of chromosomes can encode biological significance, and the implications of such structures have been increasingly appreciated recently. Certain chromosome structural features, such as A/B compartmentalization, are frequently extracted from Hi-C pairwise genome contact information (physical association between different regions of the genome) and compared with linear annotations of the genome, such as histone modifications and lamina association. We investigate how additional properties of chromosome structure can be deduced using an abstract graph representation of the contact heatmap, and describe specific network properties that can have a strong connection with some of these biological annotations. We constructed chromosome structure networks (CSNs) from bulk Hi-C data and calculated a set of site-resolved (node-based) network properties. These properties are useful for characterizing certain aspects of chromosomal structure. We examined the ability of network properties to differentiate several scenarios, such as haploid vs diploid cells, partially inverted nuclei vs conventional architecture, depletion of chromosome architectural proteins, and structural changes during cell development. We also examined the connection between network properties and a series of other linear annotations, such as histone modifications and chromatin states including poised promoter and enhancer labels. We found that semi-local network properties exhibit greater capability in characterizing genome annotations compared to diffusive or ultra-local node features. For example, the local square clustering coefficient can be a strong classifier of lamina-associated domains. We demonstrated that network properties can be useful for highlighting large-scale chromosome structure differences that emerge in different biological situations.
Collapse
Affiliation(s)
- Yingjie Xu
- Graduate School of Genome Science & Technology, University of Tennessee, Knoxville, TN 37996, USA
| | - Priyojit Das
- Department of Genetics, Harvard Medical School, Boston, MA 02115, USA
- Department of Molecular Biology, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Rachel Patton McCord
- Department of Biochemistry & Cellular and Molecular Biology, University of Tennessee, Knoxville, TN 37996, USA
| | - Tongye Shen
- Department of Biochemistry & Cellular and Molecular Biology, University of Tennessee, Knoxville, TN 37996, USA
| |
Collapse
|
2
|
Zitnik M, Li MM, Wells A, Glass K, Morselli Gysi D, Krishnan A, Murali TM, Radivojac P, Roy S, Baudot A, Bozdag S, Chen DZ, Cowen L, Devkota K, Gitter A, Gosline SJC, Gu P, Guzzi PH, Huang H, Jiang M, Kesimoglu ZN, Koyuturk M, Ma J, Pico AR, Pržulj N, Przytycka TM, Raphael BJ, Ritz A, Sharan R, Shen Y, Singh M, Slonim DK, Tong H, Yang XH, Yoon BJ, Yu H, Milenković T. Current and future directions in network biology. BIOINFORMATICS ADVANCES 2024; 4:vbae099. [PMID: 39143982 PMCID: PMC11321866 DOI: 10.1093/bioadv/vbae099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Revised: 05/31/2024] [Accepted: 07/08/2024] [Indexed: 08/16/2024]
Abstract
Summary Network biology is an interdisciplinary field bridging computational and biological sciences that has proved pivotal in advancing the understanding of cellular functions and diseases across biological systems and scales. Although the field has been around for two decades, it remains nascent. It has witnessed rapid evolution, accompanied by emerging challenges. These stem from various factors, notably the growing complexity and volume of data together with the increased diversity of data types describing different tiers of biological organization. We discuss prevailing research directions in network biology, focusing on molecular/cellular networks but also on other biological network types such as biomedical knowledge graphs, patient similarity networks, brain networks, and social/contact networks relevant to disease spread. In more detail, we highlight areas of inference and comparison of biological networks, multimodal data integration and heterogeneous networks, higher-order network analysis, machine learning on networks, and network-based personalized medicine. Following the overview of recent breakthroughs across these five areas, we offer a perspective on future directions of network biology. Additionally, we discuss scientific communities, educational initiatives, and the importance of fostering diversity within the field. This article establishes a roadmap for an immediate and long-term vision for network biology. Availability and implementation Not applicable.
Collapse
Affiliation(s)
- Marinka Zitnik
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, United States
| | - Michelle M Li
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, United States
| | - Aydin Wells
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556, United States
- Lucy Family Institute for Data and Society, University of Notre Dame, Notre Dame, IN 46556, United States
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556, United States
| | - Kimberly Glass
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA 02115, United States
| | - Deisy Morselli Gysi
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA 02115, United States
- Department of Statistics, Federal University of Paraná, Curitiba, Paraná 81530-015, Brazil
- Department of Physics, Northeastern University, Boston, MA 02115, United States
| | - Arjun Krishnan
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, United States
| | - T M Murali
- Department of Computer Science, Virginia Tech, Blacksburg, VA 24061, United States
| | - Predrag Radivojac
- Khoury College of Computer Sciences, Northeastern University, Boston, MA 02115, United States
| | - Sushmita Roy
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53715, United States
- Wisconsin Institute for Discovery, Madison, WI 53715, United States
| | - Anaïs Baudot
- Aix Marseille Université, INSERM, MMG, Marseille, France
| | - Serdar Bozdag
- Department of Computer Science and Engineering, University of North Texas, Denton, TX 76203, United States
- Department of Mathematics, University of North Texas, Denton, TX 76203, United States
| | - Danny Z Chen
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556, United States
| | - Lenore Cowen
- Department of Computer Science, Tufts University, Medford, MA 02155, United States
| | - Kapil Devkota
- Department of Computer Science, Tufts University, Medford, MA 02155, United States
| | - Anthony Gitter
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53715, United States
- Morgridge Institute for Research, Madison, WI 53715, United States
| | - Sara J C Gosline
- Biological Sciences Division, Pacific Northwest National Laboratory, Seattle, WA 98109, United States
| | - Pengfei Gu
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556, United States
| | - Pietro H Guzzi
- Department of Medical and Surgical Sciences, University Magna Graecia of Catanzaro, Catanzaro, 88100, Italy
| | - Heng Huang
- Department of Computer Science, University of Maryland College Park, College Park, MD 20742, United States
| | - Meng Jiang
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556, United States
| | - Ziynet Nesibe Kesimoglu
- Department of Computer Science and Engineering, University of North Texas, Denton, TX 76203, United States
- National Center of Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20814, United States
| | - Mehmet Koyuturk
- Department of Computer and Data Sciences, Case Western Reserve University, Cleveland, OH 44106, United States
| | - Jian Ma
- Ray and Stephanie Lane Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, United States
| | - Alexander R Pico
- Institute of Data Science and Biotechnology, Gladstone Institutes, San Francisco, CA 94158, United States
| | - Nataša Pržulj
- Department of Computer Science, University College London, London, WC1E 6BT, England
- ICREA, Catalan Institution for Research and Advanced Studies, Barcelona, 08010, Spain
- Barcelona Supercomputing Center (BSC), Barcelona, 08034, Spain
| | - Teresa M Przytycka
- National Center of Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20814, United States
| | - Benjamin J Raphael
- Department of Computer Science, Princeton University, Princeton, NJ 08544, United States
| | - Anna Ritz
- Department of Biology, Reed College, Portland, OR 97202, United States
| | - Roded Sharan
- School of Computer Science, Tel Aviv University, Tel Aviv, 69978, Israel
| | - Yang Shen
- Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX 77843, United States
| | - Mona Singh
- Department of Computer Science, Princeton University, Princeton, NJ 08544, United States
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, United States
| | - Donna K Slonim
- Department of Computer Science, Tufts University, Medford, MA 02155, United States
| | - Hanghang Tong
- Department of Computer Science, University of Illinois Urbana-Champaign, Urbana, IL 61801, United States
| | - Xinan Holly Yang
- Department of Pediatrics, University of Chicago, Chicago, IL 60637, United States
| | - Byung-Jun Yoon
- Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX 77843, United States
- Computational Science Initiative, Brookhaven National Laboratory, Upton, NY 11973, United States
| | - Haiyuan Yu
- Department of Computational Biology, Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY 14853, United States
| | - Tijana Milenković
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556, United States
- Lucy Family Institute for Data and Society, University of Notre Dame, Notre Dame, IN 46556, United States
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556, United States
| |
Collapse
|
3
|
Parker SM, Davis ES, Phanstiel DH. Guiding the design of well-powered Hi-C experiments to detect differential loops. BIOINFORMATICS ADVANCES 2023; 3:vbad152. [PMID: 38023330 PMCID: PMC10645293 DOI: 10.1093/bioadv/vbad152] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 08/28/2023] [Accepted: 10/14/2023] [Indexed: 12/01/2023]
Abstract
Motivation Three-dimensional chromatin structure plays an important role in gene regulation by connecting regulatory regions and gene promoters. The ability to detect the formation and loss of these loops in various cell types and conditions provides valuable information on the mechanisms driving these cell states and is critical for understanding long-range gene regulation. Hi-C is a powerful technique for characterizing 3D chromatin structure; however, Hi-C can quickly become costly and labor-intensive, and proper planning is required to ensure efficient use of time and resources while maintaining experimental rigor and well-powered results. Results To facilitate better planning and interpretation of human Hi-C experiments, we conducted a detailed evaluation of statistical power using publicly available Hi-C datasets, paying particular attention to the impact of loop size on Hi-C contacts and fold change compression. In addition, we have developed Hi-C Poweraid, a publicly hosted web application to investigate these findings. For experiments involving well-replicated cell lines, we recommend a total sequencing depth of at least 6 billion contacts per condition, split between at least two replicates to achieve the power to detect differences in the majority of loops. For experiments with higher variation, more replicates and deeper sequencing depths are required. Values for specific cases can be determined by using Hi-C Poweraid. This tool simplifies Hi-C power calculations, allowing for more efficient use of time and resources and more accurate interpretation of experimental results. Availability and implementation Hi-C Poweraid is available as an R Shiny application deployed at http://phanstiel-lab.med.unc.edu/poweraid/, with code available at https://github.com/sarmapar/poweraid.
Collapse
Affiliation(s)
- Sarah M Parker
- Curriculum in Bioinformatics and Computational Biology, Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, United States
| | - Eric S Davis
- Curriculum in Bioinformatics and Computational Biology, Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, United States
| | - Douglas H Phanstiel
- Curriculum in Bioinformatics and Computational Biology, Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, United States
- Curriculum in Genetics and Molecular Biology, Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, United States
- Thurston Arthritis Research Center, University of North Carolina, Chapel Hill, NC, 27599, United States
- Department of Cell Biology and Physiology, University of North Carolina, Chapel Hill, NC, 27599, United States
- Lineberger Comprehensive Cancer Center, The University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, United States
| |
Collapse
|
4
|
Zhang Y, Zhang J, Zhang W, Wang M, Wang S, Xu Y, Zhao L, Li X, Li G. Mapping Multi-factor-mediated Chromatin Interactions to Assess Dysregulation of Lung Cancer-related Genes. GENOMICS, PROTEOMICS & BIOINFORMATICS 2023; 21:573-588. [PMID: 36702236 PMCID: PMC10787015 DOI: 10.1016/j.gpb.2023.01.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/25/2021] [Revised: 11/30/2022] [Accepted: 01/17/2023] [Indexed: 01/25/2023]
Abstract
Studies on the lung cancer genome are indispensable for developing a cure for lung cancer. Whole-genome resequencing, genome-wide association studies, and transcriptome sequencing have greatly improved our understanding of the cancer genome. However, dysregulation of long-range chromatin interactions in lung cancer remains poorly described. To better understand the three-dimensional (3D) genomic interaction features of the lung cancer genome, we used the A549 cell line as a model system and generated high-resolution chromatin interactions associated with RNA polymerase II (RNAPII), CCCTC-binding factor (CTCF), enhancer of zeste homolog 2 (EZH2), and histone 3 lysine 27 trimethylation (H3K27me3) using long-read chromatin interaction analysis by paired-end tag sequencing (ChIA-PET). Analysis showed that EZH2/H3K27me3-mediated interactions further repressed target genes, either through loops or domains, and their distributions along the genome were distinct from and complementary to those associated with RNAPII. Cancer-related genes were highly enriched with chromatin interactions, and chromatin interactions specific to the A549 cell line were associated with oncogenes and tumor suppressor genes, such as additional repressive interactions on FOXO4 and promoter-promoter interactions between NF1 and RNF135. Knockout of an anchor associated with chromatin interactions reversed the dysregulation of cancer-related genes, suggesting that chromatin interactions are essential for proper expression of lung cancer-related genes. These findings demonstrate the 3D landscape and gene regulatory relationships of the lung cancer genome.
Collapse
Affiliation(s)
- Yan Zhang
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China; Hubei Key Laboratory of Agricultural Bioinformatics and Hubei Engineering Technology Research Center of Agricultural Big Data, 3D Genomics Research Center, Huazhong Agricultural University, Wuhan 430070, China
| | - Jingwen Zhang
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China; Hubei Key Laboratory of Agricultural Bioinformatics and Hubei Engineering Technology Research Center of Agricultural Big Data, 3D Genomics Research Center, Huazhong Agricultural University, Wuhan 430070, China
| | - Wei Zhang
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China
| | - Mohan Wang
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China; Hubei Key Laboratory of Agricultural Bioinformatics and Hubei Engineering Technology Research Center of Agricultural Big Data, 3D Genomics Research Center, Huazhong Agricultural University, Wuhan 430070, China
| | - Shuangqi Wang
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China; Hubei Key Laboratory of Agricultural Bioinformatics and Hubei Engineering Technology Research Center of Agricultural Big Data, 3D Genomics Research Center, Huazhong Agricultural University, Wuhan 430070, China
| | - Yao Xu
- Hubei Key Laboratory of Agricultural Bioinformatics and Hubei Engineering Technology Research Center of Agricultural Big Data, 3D Genomics Research Center, Huazhong Agricultural University, Wuhan 430070, China
| | - Lun Zhao
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China
| | - Xingwang Li
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China
| | - Guoliang Li
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China; Hubei Key Laboratory of Agricultural Bioinformatics and Hubei Engineering Technology Research Center of Agricultural Big Data, 3D Genomics Research Center, Huazhong Agricultural University, Wuhan 430070, China.
| |
Collapse
|
5
|
Pancaldi V. Network models of chromatin structure. Curr Opin Genet Dev 2023; 80:102051. [PMID: 37245241 DOI: 10.1016/j.gde.2023.102051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2022] [Revised: 04/13/2023] [Accepted: 04/14/2023] [Indexed: 05/30/2023]
Abstract
Increasing numbers of datasets and experimental assays that capture the organization of chromatin inside the nucleus warrant an effort to develop tools to visualize and analyze these structures. Alongside polymer physics or constraint-based modeling, network theory approaches to describe 3D epigenome organization have gained in popularity. Representing genomic regions as nodes in a network enables visualization of 1D epigenomics datasets in the context of chromatin structure maps, while network theory metrics can be used to describe 3D epigenome organization and dynamics. In this review, we summarize the most salient applications of network theory to the study of chromatin contact maps, demonstrating its potential in revealing epigenomic patterns and relating them to cellular phenotypes.
Collapse
Affiliation(s)
- Vera Pancaldi
- CRCT, Université de Toulouse, Inserm, CNRS, Université Toulouse III-Paul Sabatier, Centre de Recherches en Cancérologie de Toulouse, Toulouse, France.
| |
Collapse
|
6
|
Parker SM, Davis ES, Phanstiel DH. Guiding the design of well-powered Hi-C experiments to detect differential loops. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.15.532762. [PMID: 36993250 PMCID: PMC10055049 DOI: 10.1101/2023.03.15.532762] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
3D chromatin structure plays an important role in gene regulation by connecting regulatory regions and gene promoters. The ability to detect the formation and loss of these loops in various cell types and conditions provides valuable information on the mechanisms driving these cell states and is critical for understanding how long-range gene regulation works. Hi-C is a powerful technique used to characterize three-dimensional chromatin structure; however, Hi-C can quickly become a costly and labor-intensive endeavor, and proper planning is required to determine how to best use time and resources while maintaining experimental rigor and well-powered results. To facilitate better planning and interpretation of Hi-C experiments, we have conducted a detailed evaluation of statistical power using publicly available Hi-C datasets paying particular attention to the impact of loop size on Hi-C contacts and fold change compression. In addition, we have developed Hi-C Poweraid, a publicly-hosted web application to investigate these findings (http://phanstiel-lab.med.unc.edu/poweraid/). For experiments involving well-replicated cell lines, we recommend a total sequencing depth of at least 6 billion contacts per condition, split between at least 2 replicates in order to achieve the power to detect the majority of differential loops. For experiments with higher variation, more replicates and deeper sequencing depths are required. Exact values and recommendations for specific cases can be determined through the use of Hi-C Poweraid. This tool simplifies the complexities behind calculating power for Hi-C data and will provide useful information on the amount of well-powered loops an experiment will be able to detect given a specific set of experimental parameters, such as sequencing depth, replicates, and the sizes of the loops of interest. This will allow for more efficient use of time and resources and more accurate interpretation of experimental results.
Collapse
Affiliation(s)
- Sarah M Parker
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, NC, USA
| | - Eric S Davis
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, NC, USA
| | - Douglas H Phanstiel
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, NC, USA
- Curriculum in Genetics and Molecular Biology, University of North Carolina at Chapel Hill, NC, USA
- Thurston Arthritis Research Center, University of North Carolina, Chapel Hill, NC, USA
- Department of Cell Biology and Physiology, University of North Carolina, Chapel Hill, NC, USA
- Lineberger Comprehensive Cancer Center, The University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| |
Collapse
|
7
|
Mora A, Huang X, Jauhari S, Jiang Q, Li X. Chromatin Hubs: A biological and computational outlook. Comput Struct Biotechnol J 2022; 20:3796-3813. [PMID: 35891791 PMCID: PMC9304431 DOI: 10.1016/j.csbj.2022.07.002] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2022] [Revised: 07/02/2022] [Accepted: 07/02/2022] [Indexed: 11/20/2022] Open
Abstract
This review discusses our current understanding of chromatin biology and bioinformatics under the unifying concept of “chromatin hubs.” The first part reviews the biology of chromatin hubs, including chromatin–chromatin interaction hubs, chromatin hubs at the nuclear periphery, hubs around macromolecules such as RNA polymerase or lncRNAs, and hubs around nuclear bodies such as the nucleolus or nuclear speckles. The second part reviews existing computational methods, including enhancer–promoter interaction prediction, network analysis, chromatin domain callers, transcription factory predictors, and multi-way interaction analysis. We introduce an integrated model that makes sense of the existing evidence. Understanding chromatin hubs may allow us (i) to explain long-unsolved biological questions such as interaction specificity and redundancy of mechanisms, (ii) to develop more realistic kinetic and functional predictions, and (iii) to explain the etiology of genomic disease.
Collapse
Affiliation(s)
- Antonio Mora
- Joint School of Life Sciences, Guangzhou Medical University and Guangzhou Institutes of Biomedicine and Health (Chinese Academy of Sciences), Guangzhou 511436, PR China
- Corresponding authors.
| | - Xiaowei Huang
- Joint School of Life Sciences, Guangzhou Medical University and Guangzhou Institutes of Biomedicine and Health (Chinese Academy of Sciences), Guangzhou 511436, PR China
| | - Shaurya Jauhari
- Joint School of Life Sciences, Guangzhou Medical University and Guangzhou Institutes of Biomedicine and Health (Chinese Academy of Sciences), Guangzhou 511436, PR China
| | - Qin Jiang
- Affiliated Eye Hospital of Nanjing Medical University, Nanjing 210000, PR China
| | - Xuri Li
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-Sen University, and Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangzhou 510060, PR China
- Corresponding authors.
| |
Collapse
|
8
|
Jablonski KP, Carron L, Mozziconacci J, Forné T, Hütt MT, Lesne A. Contribution of 3D genome topological domains to genetic risk of cancers: a genome-wide computational study. Hum Genomics 2022; 16:2. [PMID: 35016721 PMCID: PMC8753905 DOI: 10.1186/s40246-022-00375-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Accepted: 01/02/2022] [Indexed: 01/31/2023] Open
Abstract
Background Genome-wide association studies have identified statistical associations between various diseases, including cancers, and a large number of single-nucleotide polymorphisms (SNPs). However, they provide no direct explanation of the mechanisms underlying the association. Based on the recent discovery that changes in three-dimensional genome organization may have functional consequences on gene regulation favoring diseases, we investigated systematically the genome-wide distribution of disease-associated SNPs with respect to a specific feature of 3D genome organization: topologically associating domains (TADs) and their borders. Results For each of 449 diseases, we tested whether the associated SNPs are present in TAD borders more often than observed by chance, where chance (i.e., the null model in statistical terms) corresponds to the same number of pointwise loci drawn at random either in the entire genome, or in the entire set of disease-associated SNPs listed in the GWAS catalog. Our analysis shows that a fraction of diseases displays such a preferential localization of their risk loci. Moreover, cancers are relatively more frequent among these diseases, and this predominance is generally enhanced when considering only intergenic SNPs. The structure of SNP-based diseasome networks confirms that localization of risk loci in TAD borders differs between cancers and non-cancer diseases. Furthermore, different TAD border enrichments are observed in embryonic stem cells and differentiated cells, consistent with changes in topological domains along embryogenesis and delineating their contribution to disease risk. Conclusions Our results suggest that, for certain diseases, part of the genetic risk lies in a local genetic variation affecting the genome partitioning in topologically insulated domains. Investigating this possible contribution to genetic risk is particularly relevant in cancers. This study thus opens a way of interpreting genome-wide association studies, by distinguishing two types of disease-associated SNPs: one with an effect on an individual gene, the other acting in interplay with 3D genome organization. Supplementary Information The online version contains supplementary material available at 10.1186/s40246-022-00375-2.
Collapse
Affiliation(s)
- Kim Philipp Jablonski
- Department of Biosystems Science and Engineering, ETH Zurich, 4058, Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, 4058, Basel, Switzerland
| | - Leopold Carron
- Laboratoire de Physique Théorique de la Matière Condensée, LPTMC, CNRS, Sorbonne Université, Paris, France.,Laboratory of Computational and Quantitative Biology, LCQB, Sorbonne Université, Paris, France
| | - Julien Mozziconacci
- Laboratoire de Physique Théorique de la Matière Condensée, LPTMC, CNRS, Sorbonne Université, Paris, France.,Structure et Instabilité des Génomes, Muséum National d'Histoire Naturelle, Paris, France
| | - Thierry Forné
- Institut de Génétique Moléculaire de Montpellier, IGMM, CNRS, Univ. Montpellier, Montpellier, France
| | - Marc-Thorsten Hütt
- Department of Life Sciences and Chemistry, Jacobs University Bremen, Bremen, Germany.
| | - Annick Lesne
- Laboratoire de Physique Théorique de la Matière Condensée, LPTMC, CNRS, Sorbonne Université, Paris, France. .,Institut de Génétique Moléculaire de Montpellier, IGMM, CNRS, Univ. Montpellier, Montpellier, France.
| |
Collapse
|
9
|
Pancaldi V. Chromatin Network Analyses: Towards Structure-Function Relationships in Epigenomics. FRONTIERS IN BIOINFORMATICS 2021; 1:742216. [PMID: 36303769 PMCID: PMC9581029 DOI: 10.3389/fbinf.2021.742216] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2021] [Accepted: 10/04/2021] [Indexed: 01/16/2023] Open
Abstract
Recent technological advances have allowed us to map chromatin conformation and uncover the genome's spatial organization of the genome inside the nucleus. These experiments have revealed the complexities of genome folding, characterized by the presence of loops and domains at different scales, which can change across development and in different cell types. There is strong evidence for a relationship between the topological properties of chromatin contacts and cellular phenotype. Chromatin can be represented as a network, in which genomic fragments are the nodes and connections represent experimentally observed spatial proximity of two genomically distant regions in a specific cell type or biological condition. With this approach we can consider a variety of chromatin features in association with the 3D structure, investigating how nuclear chromatin organization can be related to gene regulation, replication, malignancy, phenotypic variability and plasticity. We briefly review the results obtained on genome architecture through network theoretic approaches. As previously observed in protein-protein interaction networks and many types of non-biological networks, external conditions could shape network topology through a yet unidentified structure-function relationship. Similar to scientists studying the brain, we are confronted with a duality between a spatially embedded network of physical contacts, a related network of correlation in the dynamics of network nodes and, finally, an abstract definition of function of this network, related to phenotype. We summarise major developments in the study of networks in other fields, which we think can suggest a path towards better understanding how 3D genome configuration can impact biological function and adaptation to the environment.
Collapse
Affiliation(s)
- Vera Pancaldi
- Centre de Recherches en Cancérologie de Toulouse (CRCT), Institut National de la Santé et de la Recherche Médicale (Inserm) U1037, Centre National de la Recherche Scientifique (CNRS) U5071, Université Paul Sabatier, Toulouse, France
- Barcelona Supercomputing Center, Barcelona, Spain
| |
Collapse
|