1
|
Hansen P, Blau H, Hecht J, Karlebach G, Krannich A, Steinhaus R, Truss M, Robinson PN. Using paired-end read orientations to assess technical biases in capture Hi-C. NAR Genom Bioinform 2024; 6:lqae156. [PMID: 39660253 PMCID: PMC11630073 DOI: 10.1093/nargab/lqae156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2024] [Revised: 10/17/2024] [Accepted: 10/30/2024] [Indexed: 12/12/2024] Open
Abstract
Hi-C and capture Hi-C (CHi-C) both leverage paired-end sequencing of chimeric fragments to gauge the strength of interactions based on the total number of paired-end reads mapped to a common pair of restriction fragments. Mapped paired-end reads can have four relative orientations, depending on the genomic positions and strands of the two reads. We assigned one paired-end read orientation to each of the four possible re-ligations that can occur between two given restriction fragments. In a large hematopoietic cell dataset, we determined the read pair counts of interactions separately for each orientation. Interactions with imbalances in the counts occur much more often than expected by chance for both Hi-C and CHi-C. Based on such imbalances, we identified target restriction fragments enriched at only one instead of both ends. By matching them to the baits used for the experiments, we confirmed our assignment of paired-end read orientations and gained insights that can inform bait design. An analysis of unbaited fragments shows that, beyond bait effects, other known types of technical biases are reflected in count imbalances. Taking advantage of distance-dependent contact frequencies, we assessed the impact of such biases. Our results have the potential to improve the design and interpretation of CHi-C experiments.
Collapse
Affiliation(s)
- Peter Hansen
- The Robinson Lab, The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, 06032, Connecticut, USA
- The Robinson Lab, Berlin Institute of Health, Luisenstr. 65, 10117, Berlin, Germany
| | - Hannah Blau
- The Robinson Lab, The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, 06032, Connecticut, USA
| | - Jochen Hecht
- Genomics Unit, Centre for Genomic Regulation, Carrer del Dr. Aiguader 88, 08003, Barcelona, Spain
| | - Guy Karlebach
- The Robinson Lab, The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, 06032, Connecticut, USA
| | - Alexander Krannich
- Experimental and Clinical Research Center, Charité Universitätsmedizin Berlin, Lindenberger Weg 80, 13125, Berlin, Germany
| | - Robin Steinhaus
- Exploratory Diagnostic Sciences, Berlin Institute of Health, Charitéplatz 1, 10117, Berlin, Germany
- Institute of Medical Genetics and Human Genetics, Charité – Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt Universität zu Berlin, Augustenburger Platz 1, 13353, Berlin, Germany
| | - Matthias Truss
- Labor für Pädiatrische Molekularbiologie, Charité Universitätsmedizin Berlin, Augustenburger Platz 1, 13353, Berlin, Germany
| | - Peter N Robinson
- The Robinson Lab, The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, 06032, Connecticut, USA
- The Robinson Lab, Berlin Institute of Health, Luisenstr. 65, 10117, Berlin, Germany
| |
Collapse
|
2
|
Pudjihartono M, Pudjihartono N, O'Sullivan JM, Schierding W. Melanoma-specific mutation hotspots in distal, non-coding, promoter-interacting regions implicate novel candidate driver genes. Br J Cancer 2024; 131:1644-1655. [PMID: 39367275 PMCID: PMC11555344 DOI: 10.1038/s41416-024-02870-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2024] [Revised: 09/23/2024] [Accepted: 09/26/2024] [Indexed: 10/06/2024] Open
Abstract
BACKGROUND To develop targeted treatments, it is crucial to identify the full spectrum of genetic drivers in melanoma, including those in non-coding regions. However, recent efforts to explore non-coding regions have primarily focused on gene-adjacent elements such as promoters and non-coding RNAs, leaving intergenic distal regulatory elements largely unexplored. METHODS We used Hi-C chromatin contact data from melanoma cells to map distal, non-coding, promoter-interacting regulatory elements genome-wide in melanoma. Using this "promoter-interaction network", alongside whole-genome sequence and gene expression data from the Pan Cancer Analysis of Whole Genomes, we developed multivariate linear regression models to identify distal somatic mutation hotspots that affect promoter activity. RESULTS We identified eight recurrently mutated hotspots that are novel, melanoma-specific, located in promoter-interacting distal regulatory elements, alter transcription factor binding motifs, and affect the expression of genes (e.g., HSPB7, CLDN1, ADCY9 and FDXR) previously implicated as tumour suppressors/oncogenes in various cancers. CONCLUSIONS Our study suggests additional non-coding drivers beyond the well-characterised TERT promoter in melanoma, offering new insights into the disruption of complex regulatory networks by non-coding mutations that may contribute to melanoma development. Furthermore, our study provides a framework for integrating multiple levels of biological data to uncover cancer-specific non-coding drivers.
Collapse
Affiliation(s)
- Michael Pudjihartono
- Liggins Institute, The University of Auckland, Auckland, New Zealand
- The Maurice Wilkins Centre, The University of Auckland, Auckland, New Zealand
| | | | - Justin M O'Sullivan
- Liggins Institute, The University of Auckland, Auckland, New Zealand.
- The Maurice Wilkins Centre, The University of Auckland, Auckland, New Zealand.
- Australian Parkinson's Mission, Garvan Institute of Medical Research, Sydney, NSW, Australia.
- MRC Lifecourse Epidemiology Unit, University of Southampton, Southampton, UK.
- Singapore Institute for Clinical Sciences, Agency for Science, Technology and Research (A*STAR), Singapore, Singapore.
| | - William Schierding
- Liggins Institute, The University of Auckland, Auckland, New Zealand.
- The Maurice Wilkins Centre, The University of Auckland, Auckland, New Zealand.
- Department of Ophthalmology, University of Auckland, Auckland, New Zealand.
| |
Collapse
|
3
|
Rossi S, Richards EL, Orozco G, Eyre S. Functional Genomics in Psoriasis. Int J Mol Sci 2024; 25:7349. [PMID: 39000456 PMCID: PMC11242296 DOI: 10.3390/ijms25137349] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2024] [Revised: 07/02/2024] [Accepted: 07/02/2024] [Indexed: 07/16/2024] Open
Abstract
Psoriasis is an autoimmune cutaneous condition that significantly impacts quality of life and represents a burden on society due to its prevalence. Genome-wide association studies (GWASs) have pinpointed several psoriasis-related risk loci, underlining the disease's complexity. Functional genomics is paramount to unveiling the role of such loci in psoriasis and disentangling its complex nature. In this review, we aim to elucidate the main findings in this field and integrate our discussion with gold-standard techniques in molecular biology-i.e., Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-and high-throughput technologies. These tools are vital to understanding how disease risk loci affect gene expression in psoriasis, which is crucial in identifying new targets for personalized treatments in advanced precision medicine.
Collapse
Affiliation(s)
| | | | | | - Stephen Eyre
- Centre for Genetics and Genomics versus Arthritis, Division of Musculoskeletal and Dermatological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester M13 9PL, UK; (S.R.); (E.L.R.); (G.O.)
| |
Collapse
|
4
|
Yang J, Zhu X, Wang R, Li M, Tang Q. Revisiting Assessment of Computational Methods for Hi-C Data Analysis. Int J Mol Sci 2023; 24:13814. [PMID: 37762117 PMCID: PMC10531246 DOI: 10.3390/ijms241813814] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Revised: 08/30/2023] [Accepted: 09/03/2023] [Indexed: 09/29/2023] Open
Abstract
The performances of algorithms for Hi-C data preprocessing, the identification of topologically associating domains, and the detection of chromatin interactions and promoter-enhancer interactions have been mostly evaluated using semi-quantitative or synthetic data approaches, without utilizing the most recent methods, since 2017. In this study, we comprehensively evaluated 24 popular state-of-the-art methods for the complete end-to-end pipeline of Hi-C data analysis, using manually curated or experimentally validated benchmark datasets, including a CRISPR dataset for promoter-enhancer interaction validation. Our results indicate that, although no single method exhibited superior performance in all situations, HiC-Pro, DomainCaller, and Fit-Hi-C2 showed relatively balanced performances of most evaluation metrics for preprocessing, topologically associating domain identification, and chromatin interaction/promoter-enhancer interaction detection, respectively. The comprehensive comparison presented in this manuscript provides a reference for researchers to choose Hi-C analysis tools that best suit their needs.
Collapse
Affiliation(s)
- Jing Yang
- Livestock and Poultry Multi-Omics Key Laboratory of Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu 611130, China; (J.Y.); (X.Z.); (R.W.)
- Animal Breeding and Genetics Key Laboratory of Sichuan Province, Sichuan Animal Science Academy, Chengdu 610066, China
| | - Xingxing Zhu
- Livestock and Poultry Multi-Omics Key Laboratory of Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu 611130, China; (J.Y.); (X.Z.); (R.W.)
- Animal Breeding and Genetics Key Laboratory of Sichuan Province, Sichuan Animal Science Academy, Chengdu 610066, China
| | - Rui Wang
- Livestock and Poultry Multi-Omics Key Laboratory of Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu 611130, China; (J.Y.); (X.Z.); (R.W.)
- Animal Breeding and Genetics Key Laboratory of Sichuan Province, Sichuan Animal Science Academy, Chengdu 610066, China
| | - Mingzhou Li
- Livestock and Poultry Multi-Omics Key Laboratory of Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu 611130, China; (J.Y.); (X.Z.); (R.W.)
- Animal Breeding and Genetics Key Laboratory of Sichuan Province, Sichuan Animal Science Academy, Chengdu 610066, China
| | - Qianzi Tang
- Livestock and Poultry Multi-Omics Key Laboratory of Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu 611130, China; (J.Y.); (X.Z.); (R.W.)
- Animal Breeding and Genetics Key Laboratory of Sichuan Province, Sichuan Animal Science Academy, Chengdu 610066, China
| |
Collapse
|
5
|
Orozco G, Schoenfelder S, Walker N, Eyre S, Fraser P. 3D genome organization links non-coding disease-associated variants to genes. Front Cell Dev Biol 2022; 10:995388. [PMID: 36340032 PMCID: PMC9631826 DOI: 10.3389/fcell.2022.995388] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2022] [Accepted: 09/27/2022] [Indexed: 11/13/2022] Open
Abstract
Genome sequencing has revealed over 300 million genetic variations in human populations. Over 90% of variants are single nucleotide polymorphisms (SNPs), the remainder include short deletions or insertions, and small numbers of structural variants. Hundreds of thousands of these variants have been associated with specific phenotypic traits and diseases through genome wide association studies which link significant differences in variant frequencies with specific phenotypes among large groups of individuals. Only 5% of disease-associated SNPs are located in gene coding sequences, with the potential to disrupt gene expression or alter of the function of encoded proteins. The remaining 95% of disease-associated SNPs are located in non-coding DNA sequences which make up 98% of the genome. The role of non-coding, disease-associated SNPs, many of which are located at considerable distances from any gene, was at first a mystery until the discovery that gene promoters regularly interact with distal regulatory elements to control gene expression. Disease-associated SNPs are enriched at the millions of gene regulatory elements that are dispersed throughout the non-coding sequences of the genome, suggesting they function as gene regulation variants. Assigning specific regulatory elements to the genes they control is not straightforward since they can be millions of base pairs apart. In this review we describe how understanding 3D genome organization can identify specific interactions between gene promoters and distal regulatory elements and how 3D genomics can link disease-associated SNPs to their target genes. Understanding which gene or genes contribute to a specific disease is the first step in designing rational therapeutic interventions.
Collapse
Affiliation(s)
- Gisela Orozco
- Centre for Genetics and Genomics Versus Arthritis, Division of Musculoskeletal and Dermatological Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Manchester Academic Health Science Centre, Manchester, United Kingdom
- NIHR Manchester Biomedical Research Centre, Manchester University Foundation Trust, Manchester, United Kingdom
| | - Stefan Schoenfelder
- Enhanc3D Genomics Ltd., Cambridge, United Kingdom
- Epigenetics Programme, The Babraham Institute, Babraham Research Campus, CB22 3AT Cambridge, Cambridge, United Kingdom
| | | | - Stephan Eyre
- Centre for Genetics and Genomics Versus Arthritis, Division of Musculoskeletal and Dermatological Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Manchester Academic Health Science Centre, Manchester, United Kingdom
- NIHR Manchester Biomedical Research Centre, Manchester University Foundation Trust, Manchester, United Kingdom
| | - Peter Fraser
- Enhanc3D Genomics Ltd., Cambridge, United Kingdom
- Department of Biological Science, Florida State University, Tallahassee, FL, United States
| |
Collapse
|