1
|
Jalili V, Cremona MA, Palluzzi F. Rescuing biologically relevant consensus regions across replicated samples. BMC Bioinformatics 2023; 24:240. [PMID: 37286963 PMCID: PMC10246347 DOI: 10.1186/s12859-023-05340-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Accepted: 05/16/2023] [Indexed: 06/09/2023] Open
Abstract
BACKGROUND Protein-DNA binding sites of ChIP-seq experiments are identified where the binding affinity is significant based on a given threshold. The choice of the threshold is a trade-off between conservative region identification and discarding weak, but true binding sites. RESULTS We rescue weak binding sites using MSPC, which efficiently exploits replicates to lower the threshold required to identify a site while keeping a low false-positive rate, and we compare it to IDR, a widely used post-processing method for identifying highly reproducible peaks across replicates. We observe several master transcription regulators (e.g., SP1 and GATA3) and HDAC2-GATA1 regulatory networks on rescued regions in K562 cell line. CONCLUSIONS We argue the biological relevance of weak binding sites and the information they add when rescued by MSPC. An implementation of the proposed extended MSPC methodology and the scripts to reproduce the performed analysis are freely available at https://genometric.github.io/MSPC/ ; MSPC is distributed as a command-line application and an R package available from Bioconductor ( https://doi.org/doi:10.18129/B9.bioc.rmspc ).
Collapse
Affiliation(s)
- Vahid Jalili
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| | - Marzia A Cremona
- Department of Operations and Decision Systems, Université Laval, Quebec, Canada.
- CHU de Québec - Université Laval Research Center, Quebec, Canada.
| | - Fernando Palluzzi
- Department of Brain and Behavioral Sciences, Università di Pavia, Pavia, Italy.
| |
Collapse
|
2
|
Kumar A, Hu MY, Mei Y, Fan Y. CSSQ: a ChIP-seq signal quantifier pipeline. Front Cell Dev Biol 2023; 11:1167111. [PMID: 37305684 PMCID: PMC10248417 DOI: 10.3389/fcell.2023.1167111] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2023] [Accepted: 05/02/2023] [Indexed: 06/13/2023] Open
Abstract
Chromatin immunoprecipitation followed by sequencing (ChIP-seq) has revolutionized the studies of epigenomes and the massive increase in ChIP-seq datasets calls for robust and user-friendly computational tools for quantitative ChIP-seq. Quantitative ChIP-seq comparisons have been challenging due to noisiness and variations inherent to ChIP-seq and epigenomes. By employing innovative statistical approaches specially catered to ChIP-seq data distribution and sophisticated simulations along with extensive benchmarking studies, we developed and validated CSSQ as a nimble statistical analysis pipeline capable of differential binding analysis across ChIP-seq datasets with high confidence and sensitivity and low false discovery rate with any defined regions. CSSQ models ChIP-seq data as a finite mixture of Gaussians faithfully that reflects ChIP-seq data distribution. By a combination of Anscombe transformation, k-means clustering, estimated maximum normalization, CSSQ minimizes noise and bias from experimental variations. Further, CSSQ utilizes a non-parametric approach and incorporates comparisons under the null hypothesis by unaudited column permutation to perform robust statistical tests to account for fewer replicates of ChIP-seq datasets. In sum, we present CSSQ as a powerful statistical computational pipeline tailored for ChIP-seq data quantitation and a timely addition to the tool kits of differential binding analysis to decipher epigenomes.
Collapse
Affiliation(s)
- Ashwath Kumar
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, United States
| | - Michael Y. Hu
- Department of Computer Science, Princeton University, Princeton, NJ, United States
| | - Yajun Mei
- H. Milton Stewart School of Industrial and Systems Engineering, Georgia Institute of Technology, Atlanta, GA, United States
- Parker H. Petit Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, GA, United States
| | - Yuhong Fan
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, United States
- Parker H. Petit Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, GA, United States
| |
Collapse
|
3
|
Chen H, Tu S, Yuan C, Tian F, Zhang Y, Sun Y, Shao Z. HyperChIP: identification of hypervariable signals across ChIP-seq or ATAC-seq samples. Genome Biol 2022; 23:62. [PMID: 35227282 PMCID: PMC8883642 DOI: 10.1186/s13059-022-02627-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2021] [Accepted: 02/07/2022] [Indexed: 12/13/2022] Open
Abstract
Identifying genomic regions with hypervariable ChIP-seq or ATAC-seq signals across given samples is essential for large-scale epigenetic studies. In particular, the hypervariable regions across tumors from different patients indicate their heterogeneity and can contribute to revealing potential cancer subtypes and the associated epigenetic markers. We present HyperChIP as the first complete statistical tool for the task. HyperChIP uses scaled variances that account for the mean-variance dependence to rank genomic regions, and it increases the statistical power by diminishing the influence of true hypervariable regions on model fitting. A pan-cancer case study illustrates the practical utility of HyperChIP.
Collapse
|
4
|
Faux T, Rytkönen KT, Mahmoudian M, Paulin N, Junttila S, Laiho A, Elo LL. Differential ATAC-seq and ChIP-seq peak detection using ROTS. NAR Genom Bioinform 2021; 3:lqab059. [PMID: 34235431 PMCID: PMC8253552 DOI: 10.1093/nargab/lqab059] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2021] [Revised: 05/12/2021] [Accepted: 06/11/2021] [Indexed: 12/30/2022] Open
Abstract
Changes in cellular chromatin states fine-tune transcriptional output and ultimately lead to phenotypic changes. Here we propose a novel application of our reproducibility-optimized test statistics (ROTS) to detect differential chromatin states (ATAC-seq) or differential chromatin modification states (ChIP-seq) between conditions. We compare the performance of ROTS to existing and widely used methods for ATAC-seq and ChIP-seq data using both synthetic and real datasets. Our results show that ROTS outperformed other commonly used methods when analyzing ATAC-seq data. ROTS also displayed the most accurate detection of small differences when modeling with synthetic data. We observed that two-step methods that require the use of a separate peak caller often more accurately called enrichment borders, whereas one-step methods without a separate peak calling step were more versatile in calling sub-peaks. The top ranked differential regions detected by the methods had marked correlation with transcriptional differences of the closest genes. Overall, our study provides evidence that ROTS is a useful addition to the available differential peak detection methods to study chromatin and performs especially well when applied to study differential chromatin states in ATAC-seq data.
Collapse
Affiliation(s)
- Thomas Faux
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, Tykistökatu 6, 20520, Turku, Finland
| | - Kalle T Rytkönen
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, Tykistökatu 6, 20520, Turku, Finland
- Institute of Biomedicine, University of Turku, Kiinamyllynkatu 10, 20014, Finland
| | - Mehrad Mahmoudian
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, Tykistökatu 6, 20520, Turku, Finland
- Department of Future Technologies, University of Turku, FI-20014 Turku, Finland
| | - Niklas Paulin
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, Tykistökatu 6, 20520, Turku, Finland
| | - Sini Junttila
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, Tykistökatu 6, 20520, Turku, Finland
| | - Asta Laiho
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, Tykistökatu 6, 20520, Turku, Finland
| | - Laura L Elo
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, Tykistökatu 6, 20520, Turku, Finland
- Institute of Biomedicine, University of Turku, Kiinamyllynkatu 10, 20014, Finland
| |
Collapse
|
5
|
Taguchi YH, Turki T. Unsupervised tensor decomposition-based method to extract candidate transcription factors as histone modification bookmarks in post-mitotic transcriptional reactivation. PLoS One 2021; 16:e0251032. [PMID: 34032804 PMCID: PMC8148352 DOI: 10.1371/journal.pone.0251032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2021] [Accepted: 04/17/2021] [Indexed: 11/25/2022] Open
Abstract
The histone group added to a gene sequence must be removed during mitosis to halt transcription during the DNA replication stage of the cell cycle. However, the detailed mechanism of this transcription regulation remains unclear. In particular, it is not realistic to reconstruct all appropriate histone modifications throughout the genome from scratch after mitosis. Thus, it is reasonable to assume that there might be a type of "bookmark" that retains the positions of histone modifications, which can be readily restored after mitosis. We developed a novel computational approach comprising tensor decomposition (TD)-based unsupervised feature extraction (FE) to identify transcription factors (TFs) that bind to genes associated with reactivated histone modifications as candidate histone bookmarks. To the best of our knowledge, this is the first application of TD-based unsupervised FE to the cell division context and phases pertaining to the cell cycle in general. The candidate TFs identified with this approach were functionally related to cell division, suggesting the suitability of this method and the potential of the identified TFs as bookmarks for histone modification during mitosis.
Collapse
Affiliation(s)
- Y-h. Taguchi
- Department of Physics, Chuo University, Tokyo, Japan
| | - Turki Turki
- Department of Computer Science, King Abdulaziz University, Jeddah, Saudi Arabia
| |
Collapse
|
6
|
Tanaka A, Ishitsuka Y, Ohta H, Fujimoto A, Yasunaga JI, Matsuoka M. Systematic clustering algorithm for chromatin accessibility data and its application to hematopoietic cells. PLoS Comput Biol 2020; 16:e1008422. [PMID: 33253153 PMCID: PMC7728210 DOI: 10.1371/journal.pcbi.1008422] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2020] [Revised: 12/10/2020] [Accepted: 10/06/2020] [Indexed: 11/18/2022] Open
Abstract
The huge amount of data acquired by high-throughput sequencing requires data reduction for effective analysis. Here we give a clustering algorithm for genome-wide open chromatin data using a new data reduction method. This method regards the genome as a string of 1s and 0s based on a set of peaks and calculates the Hamming distances between the strings. This algorithm with the systematically optimized set of peaks enables us to quantitatively evaluate differences between samples of hematopoietic cells and classify cell types, potentially leading to a better understanding of leukemia pathogenesis.
Collapse
Affiliation(s)
- Azusa Tanaka
- Department of Human Genetics, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
- Laboratory of Virus Control, Institute for Frontier Life and Medical Sciences, Kyoto University, Kyoto, Japan
- * E-mail: (AT); (YI); (HO)
| | - Yasuhiro Ishitsuka
- Center for Science Adventure and Collaborative Research Advancement, Graduate School of Science, Kyoto University, Kyoto, Japan
- Department of Mathematics, Graduate School of Science, Kyoto University, Kyoto, Japan
- * E-mail: (AT); (YI); (HO)
| | - Hiroki Ohta
- Center for Science Adventure and Collaborative Research Advancement, Graduate School of Science, Kyoto University, Kyoto, Japan
- Department of Physics, Graduate School of Science, Kyoto University, Kyoto, Japan
- * E-mail: (AT); (YI); (HO)
| | - Akihiro Fujimoto
- Department of Human Genetics, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
| | - Jun-ichirou Yasunaga
- Laboratory of Virus Control, Institute for Frontier Life and Medical Sciences, Kyoto University, Kyoto, Japan
- Department of Hematology, Rheumatology and Infectious Disease, Faculty of Life Sciences, Kumamoto University, Kumamoto, Japan
| | - Masao Matsuoka
- Laboratory of Virus Control, Institute for Frontier Life and Medical Sciences, Kyoto University, Kyoto, Japan
- Department of Hematology, Rheumatology and Infectious Disease, Faculty of Life Sciences, Kumamoto University, Kumamoto, Japan
| |
Collapse
|
7
|
Tu S, Li M, Chen H, Tan F, Xu J, Waxman DJ, Zhang Y, Shao Z. MAnorm2 for quantitatively comparing groups of ChIP-seq samples. Genome Res 2020; 31:131-145. [PMID: 33208455 PMCID: PMC7849384 DOI: 10.1101/gr.262675.120] [Citation(s) in RCA: 38] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2020] [Accepted: 11/09/2020] [Indexed: 12/16/2022]
Abstract
Eukaryotic gene transcription is regulated by a large cohort of chromatin-associated proteins, and inferring their differential binding sites between cellular contexts requires a rigorous comparison of the corresponding ChIP-seq data. We present MAnorm2, a new computational tool for quantitatively comparing groups of ChIP-seq samples. MAnorm2 uses a hierarchical strategy for normalization of ChIP-seq data and assesses within-group variability of ChIP-seq signals based on an empirical Bayes framework. In this framework, MAnorm2 allows for abundant differential ChIP-seq signals between groups of samples as well as very different global within-group variability between groups. Using a number of real ChIP-seq data sets, we observed that MAnorm2 clearly outperformed existing tools for differential ChIP-seq analysis, especially when the groups of samples being compared had distinct global within-group variability.
Collapse
Affiliation(s)
- Shiqi Tu
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Mushan Li
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Haojie Chen
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Fengxiang Tan
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jian Xu
- Children's Medical Center Research Institute, Department of Pediatrics, University of Texas Southwestern Medical Center, Dallas, Texas 75390, USA
| | - David J Waxman
- Department of Biology and Bioinformatics Program, Boston University, Boston, Massachusetts 02215, USA
| | - Yijing Zhang
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200032, China
| | - Zhen Shao
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| |
Collapse
|
8
|
Höllbacher B, Balázs K, Heinig M, Uhlenhaut NH. Seq-ing answers: Current data integration approaches to uncover mechanisms of transcriptional regulation. Comput Struct Biotechnol J 2020; 18:1330-1341. [PMID: 32612756 PMCID: PMC7306512 DOI: 10.1016/j.csbj.2020.05.018] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2020] [Revised: 05/21/2020] [Accepted: 05/23/2020] [Indexed: 02/06/2023] Open
Abstract
Advancements in the field of next generation sequencing lead to the generation of ever-more data, with the challenge often being how to combine and reconcile results from different OMICs studies such as genome, epigenome and transcriptome. Here we provide an overview of the standard processing pipelines for ChIP-seq and RNA-seq as well as common downstream analyses. We describe popular multi-omics data integration approaches used to identify target genes and co-factors, and we discuss how machine learning techniques may predict transcriptional regulators and gene expression.
Collapse
Affiliation(s)
- Barbara Höllbacher
- Institute for Diabetes and Cancer IDC, Helmholtz Zentrum Muenchen (HMGU) and German Center for Diabetes Research (DZD), Munich 85764, Neuherberg, Germany.,Institute of Computational Biology ICB, Helmholtz Zentrum Muenchen (HMGU) and German Center for Diabetes Research (DZD), Munich 85764, Neuherberg, Germany.,Department of Informatics, TUM, Munich 85748, Garching, Germany
| | - Kinga Balázs
- Institute for Diabetes and Cancer IDC, Helmholtz Zentrum Muenchen (HMGU) and German Center for Diabetes Research (DZD), Munich 85764, Neuherberg, Germany
| | - Matthias Heinig
- Institute of Computational Biology ICB, Helmholtz Zentrum Muenchen (HMGU) and German Center for Diabetes Research (DZD), Munich 85764, Neuherberg, Germany.,Department of Informatics, TUM, Munich 85748, Garching, Germany
| | - N Henriette Uhlenhaut
- Institute for Diabetes and Cancer IDC, Helmholtz Zentrum Muenchen (HMGU) and German Center for Diabetes Research (DZD), Munich 85764, Neuherberg, Germany.,Metabolic Programming, TUM School of Life Sciences Weihenstephan, Munich 85354, Freising, Germany
| |
Collapse
|
9
|
Xiang G, Keller CA, Giardine B, An L, Li Q, Zhang Y, Hardison RC. S3norm: simultaneous normalization of sequencing depth and signal-to-noise ratio in epigenomic data. Nucleic Acids Res 2020; 48:e43. [PMID: 32086521 PMCID: PMC7192629 DOI: 10.1093/nar/gkaa105] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2019] [Revised: 01/20/2020] [Accepted: 02/10/2020] [Indexed: 12/12/2022] Open
Abstract
Quantitative comparison of epigenomic data across multiple cell types or experimental conditions is a promising way to understand the biological functions of epigenetic modifications. However, differences in sequencing depth and signal-to-noise ratios in the data from different experiments can hinder our ability to identify real biological variation from raw epigenomic data. Proper normalization is required prior to data analysis to gain meaningful insights. Most existing methods for data normalization standardize signals by rescaling either background regions or peak regions, assuming that the same scale factor is applicable to both background and peak regions. While such methods adjust for differences in sequencing depths, they do not address differences in the signal-to-noise ratios across different experiments. We developed a new data normalization method, called S3norm, that normalizes the sequencing depths and signal-to-noise ratios across different data sets simultaneously by a monotonic nonlinear transformation. We show empirically that the epigenomic data normalized by our method, compared to existing methods, can better capture real biological variation, such as impact on gene expression regulation.
Collapse
Affiliation(s)
- Guanjue Xiang
- The Bioinformatics and Genomics program, Center for Computational Biology and Bioinformatics, Huck Institutes of the Life Sciences, Wartik Laboratory, The Pennsylvania State University, University Park, PA 16802, USA
| | - Cheryl A Keller
- Dept. of Biochemistry and Molecular Biology, The Pennsylvania State University, Wartik Laboratory, University Park, PA 16802, USA
| | - Belinda Giardine
- Dept. of Biochemistry and Molecular Biology, The Pennsylvania State University, Wartik Laboratory, University Park, PA 16802, USA
| | - Lin An
- The Bioinformatics and Genomics program, Center for Computational Biology and Bioinformatics, Huck Institutes of the Life Sciences, Wartik Laboratory, The Pennsylvania State University, University Park, PA 16802, USA
| | - Qunhua Li
- Dept. of Statistics, The Pennsylvania State University, Wartik Laboratory, University Park, PA 16802, USA
| | - Yu Zhang
- Dept. of Statistics, The Pennsylvania State University, Wartik Laboratory, University Park, PA 16802, USA
| | - Ross C Hardison
- Dept. of Biochemistry and Molecular Biology, The Pennsylvania State University, Wartik Laboratory, University Park, PA 16802, USA
| |
Collapse
|
10
|
Borkowska J, Domaszewska-Szostek A, Kołodziej P, Wicik Z, Połosak J, Buyanovskaya O, Charzewski L, Stańczyk M, Noszczyk B, Puzianowska-Kuznicka M. Alterations in 5hmC level and genomic distribution in aging-related epigenetic drift in human adipose stem cells. Epigenomics 2020; 12:423-437. [PMID: 32031421 DOI: 10.2217/epi-2019-0131] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Aim: To clarify mechanisms affecting the level and distribution of 5-hydroxymethylcytosine (5hmC) during aging. Materials & methods: We examined levels and genomic distribution of 5hmC along with the expression of ten-eleven translocation methylcytosine dioxygenases (TETs) in adipose stem cells in young and age-advanced individuals. Results: 5hmC levels were higher in adipose stem cells of age-advanced than young individuals (p = 0.0003), but were not associated with age-related changes in expression of TETs. 5hmC levels correlated with population doubling time (r = 0.62; p = 0.01). We identified 58 differentially hydroxymethylated regions. Hypo-hydroxymethylated differentially hydroxymethylated regions were approximately twofold enriched in CCCTC-binding factor binding sites. Conclusion: Accumulation of 5hmC in aged cells can result from inefficient active demethylation due to altered TETs activity and reduced passive demethylation due to slower proliferation.
Collapse
Affiliation(s)
- Joanna Borkowska
- Department of Human Epigenetics, Mossakowski Medical Research Centre, PAS, 5 Pawinskiego Street, 02-106 Warsaw, Poland
| | - Anna Domaszewska-Szostek
- Department of Human Epigenetics, Mossakowski Medical Research Centre, PAS, 5 Pawinskiego Street, 02-106 Warsaw, Poland
| | - Paulina Kołodziej
- Department of Geriatrics & Gerontology, Medical Centre of Postgraduate Education, 61/63 Kleczewska Street, 01-826 Warsaw, Poland
| | - Zofia Wicik
- Department of Human Epigenetics, Mossakowski Medical Research Centre, PAS, 5 Pawinskiego Street, 02-106 Warsaw, Poland
| | - Jacek Połosak
- Department of Human Epigenetics, Mossakowski Medical Research Centre, PAS, 5 Pawinskiego Street, 02-106 Warsaw, Poland
| | - Olga Buyanovskaya
- Department of Human Epigenetics, Mossakowski Medical Research Centre, PAS, 5 Pawinskiego Street, 02-106 Warsaw, Poland
| | - Lukasz Charzewski
- Faculty of Physics, University of Warsaw, 5 Pasteur Street, 02-093 Warsaw, Poland
| | - Marek Stańczyk
- Department of General Surgery, Wolski Hospital, 17 Kasprzaka Street, 01-211 Warsaw, Poland
| | - Bartłomiej Noszczyk
- Department of Plastic Surgery, Medical Centre of Postgraduate Education, 99/103 Marymoncka Street, 01-813 Warsaw, Poland
| | - Monika Puzianowska-Kuznicka
- Department of Human Epigenetics, Mossakowski Medical Research Centre, PAS, 5 Pawinskiego Street, 02-106 Warsaw, Poland.,Department of Geriatrics & Gerontology, Medical Centre of Postgraduate Education, 61/63 Kleczewska Street, 01-826 Warsaw, Poland
| |
Collapse
|
11
|
Cejas P, Long HW. Principles and methods of integrative chromatin analysis in primary tissues and tumors. Biochim Biophys Acta Rev Cancer 2019; 1873:188333. [PMID: 31759992 DOI: 10.1016/j.bbcan.2019.188333] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2019] [Revised: 10/22/2019] [Accepted: 10/23/2019] [Indexed: 12/16/2022]
Abstract
Recent methodological advances have enabled the genome-wide interrogation of chromatin from primary tumor tissues. Integrative analysis of histone post-translational modifications, transcription factor (TF) binding and open chromatin sites in tumors across cancer stages can elucidate the aberrant epigenetic states accompanying tumor progression. Cancer-associated chromatin alterations can activate or inactivate enhancers at genes involved in cancer while still respecting cell-of-origin constrictions. Accordingly, enhancer analysis in cancer could have uses for biomarker discovery to further refine patient diagnosis and potentially sub-classify patients for tailored therapy. Methodologies used for chromatin analyses of primary tissues need to address issues distinct from cell line studies including the specific sources of variability coming from the heterogeneous cellular composition of tissues and from inter-individual (epi)genetic differences. This leads to requirements for careful histological analysis to select the specific samples and cells of interest. In analyzing tumors somatic changes should be taken into account to distinguish the genuine epigenetic changes across tumor specimens from any genetic alterations such as copy number variations (CNV). In this contribution we review a selection of current results from chromatin profiling, examine experimental methodologies and discuss specific analysis approaches. We also review specific considerations regarding tissue preparation for epigenetic analysis and conclude with our perspectives on emerging approaches that will impact studies of chromatin landscapes of clinical samples in the future.
Collapse
Affiliation(s)
- Paloma Cejas
- Department of Medical Oncology, Dana-Farber Cancer Institute and Harvard Medical School, Boston, MA, USA; Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA, USA; Translational Oncology Laboratory, Hospital La Paz Institute for Health Research (IdiPAZ) and CIBERONC, La Paz University Hospital, Madrid, Spain
| | - Henry W Long
- Department of Medical Oncology, Dana-Farber Cancer Institute and Harvard Medical School, Boston, MA, USA; Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA, USA.
| |
Collapse
|
12
|
Liu CC, Steen CB, Newman AM. Computational approaches for characterizing the tumor immune microenvironment. Immunology 2019; 158:70-84. [PMID: 31347163 PMCID: PMC6742767 DOI: 10.1111/imm.13101] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2019] [Revised: 07/16/2019] [Accepted: 07/18/2019] [Indexed: 12/13/2022] Open
Abstract
Recent advances in high-throughput molecular profiling technologies and multiplexed imaging platforms have revolutionized our ability to characterize the tumor immune microenvironment. As a result, studies of tumor-associated immune cells increasingly involve complex data sets that require sophisticated methods of computational analysis. In this review, we present an overview of key assays and related bioinformatics tools for analyzing the tumor-associated immune system in bulk tissues and at the single-cell level. In parallel, we describe how data science strategies and novel technologies have advanced tumor immunology and opened the door for new opportunities to exploit host immunity to improve cancer clinical outcomes.
Collapse
Affiliation(s)
- Candace C. Liu
- Immunology Graduate ProgramSchool of MedicineStanford UniversityStanfordCAUSA
| | - Chloé B. Steen
- Division of OncologyDepartment of MedicineStanford Cancer InstituteStanford UniversityStanfordCAUSA
| | - Aaron M. Newman
- Institute for Stem Cell Biology and Regenerative MedicineStanford UniversityStanfordCAUSA
- Department of Biomedical Data ScienceStanford UniversityStanfordCAUSA
| |
Collapse
|
13
|
Song S, Cui H, Chen S, Liu Q, Jiang R. EpiFIT: functional interpretation of transcription factors based on combination of sequence and epigenetic information. QUANTITATIVE BIOLOGY 2019. [DOI: 10.1007/s40484-019-0175-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
14
|
Faux T, Rytkönen KT, Laiho A, Elo LL. RepViz: a replicate-driven R tool for visualizing genomic regions. BMC Res Notes 2019; 12:441. [PMID: 31324268 PMCID: PMC6642542 DOI: 10.1186/s13104-019-4473-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2019] [Accepted: 07/12/2019] [Indexed: 11/10/2022] Open
Abstract
OBJECTIVE Visualization of sequencing data is an integral part of genomic data analysis. Although there are several tools to visualize sequencing data on genomic regions, they do not offer user-friendly ways to view simultaneously different groups of replicates. To address this need, we developed a tool that allows efficient viewing of both intra- and intergroup variation of sequencing counts on a genomic region, as well as their comparison to the output of user selected analysis methods, such as peak calling. RESULTS We present an R package RepViz for replicate-driven visualization of genomic regions. With ChIP-seq and ATAC-seq data we demonstrate its potential to aid visual inspection involved in the evaluation of normalization, outlier behavior, detected features from differential peak calling analysis, and combined analysis of multiple data types. RepViz is readily available on Bioconductor ( https://www.bioconductor.org/packages/devel/bioc/html/RepViz.html ) and on Github ( https://github.com/elolab/RepViz ).
Collapse
Affiliation(s)
- Thomas Faux
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, Tykistökatu 6, 20520, Turku, Finland.
| | - Kalle T Rytkönen
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, Tykistökatu 6, 20520, Turku, Finland.,Institute of Biomedicine, Research Centre for Integrative Physiology and Pharmacology, University of Turku, Kiinamyllynkatu 10, 20014, Turku, Finland
| | - Asta Laiho
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, Tykistökatu 6, 20520, Turku, Finland
| | - Laura L Elo
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, Tykistökatu 6, 20520, Turku, Finland.
| |
Collapse
|