1
|
Sandmann S, Richter S, Jiang X, Varghese J. Reconstructing Clonal Evolution-A Systematic Evaluation of Current Bioinformatics Approaches. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2023; 20:5128. [PMID: 36982036 PMCID: PMC10049679 DOI: 10.3390/ijerph20065128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Revised: 03/04/2023] [Accepted: 03/13/2023] [Indexed: 06/18/2023]
Abstract
The accurate reconstruction of clonal evolution, including the identification of newly developing, highly aggressive subclones, is essential for the application of precision medicine in cancer treatment. Reconstruction, aiming for correct variant clustering and clonal evolution tree reconstruction, is commonly performed by tedious manual work. While there is a plethora of tools to automatically generate reconstruction, their reliability, especially reasons for unreliability, are not systematically assessed. We developed clevRsim-an approach to simulate clonal evolution data, including single-nucleotide variants as well as (overlapping) copy number variants. From this, we generated 88 data sets and performed a systematic evaluation of the tools for the reconstruction of clonal evolution. The results indicate a major negative influence of a high number of clones on both clustering and tree reconstruction. Low coverage as well as an extreme number of time points usually leads to poor clustering results. An underlying branched independent evolution hampers correct tree reconstruction. A further major decline in performance could be observed for large deletions and duplications overlapping single-nucleotide variants. In summary, to explore the full potential of reconstructing clonal evolution, improved algorithms that can properly handle the identified limitations are greatly needed.
Collapse
Affiliation(s)
- Sarah Sandmann
- Institute of Medical Informatics, University of Münster, 48149 Münster, Germany
| | - Silja Richter
- Institute of Medical Informatics, University of Münster, 48149 Münster, Germany
| | - Xiaoyi Jiang
- Department of Computer Science, University of Münster, 48149 Münster, Germany
| | - Julian Varghese
- Institute of Medical Informatics, University of Münster, 48149 Münster, Germany
| |
Collapse
|
2
|
Zaragoza-Infante L, Junet V, Pechlivanis N, Fragkouli SC, Amprachamian S, Koletsa T, Chatzidimitriou A, Papaioannou M, Stamatopoulos K, Agathangelidis A, Psomopoulos F. IgIDivA: immunoglobulin intraclonal diversification analysis. Brief Bioinform 2022; 23:bbac349. [PMID: 36044248 PMCID: PMC9487589 DOI: 10.1093/bib/bbac349] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Revised: 07/25/2022] [Accepted: 07/26/2022] [Indexed: 11/14/2022] Open
Abstract
Intraclonal diversification (ID) within the immunoglobulin (IG) genes expressed by B cell clones arises due to ongoing somatic hypermutation (SHM) in a context of continuous interactions with antigen(s). Defining the nature and order of appearance of SHMs in the IG genes can assist in improved understanding of the ID process, shedding light into the ontogeny and evolution of B cell clones in health and disease. Such endeavor is empowered thanks to the introduction of high-throughput sequencing in the study of IG gene repertoires. However, few existing tools allow the identification, quantification and characterization of SHMs related to ID, all of which have limitations in their analysis, highlighting the need for developing a purpose-built tool for the comprehensive analysis of the ID process. In this work, we present the immunoglobulin intraclonal diversification analysis (IgIDivA) tool, a novel methodology for the in-depth qualitative and quantitative analysis of the ID process from high-throughput sequencing data. IgIDivA identifies and characterizes SHMs that occur within the variable domain of the rearranged IG genes and studies in detail the connections between identified SHMs, establishing mutational pathways. Moreover, it combines established and new graph-based metrics for the objective determination of ID level, combined with statistical analysis for the comparison of ID level features for different groups of samples. Of importance, IgIDivA also provides detailed visualizations of ID through the generation of purpose-built graph networks. Beyond the method design, IgIDivA has been also implemented as an R Shiny web application. IgIDivA is freely available at https://bio.tools/igidiva.
Collapse
Affiliation(s)
- Laura Zaragoza-Infante
- Institute of Applied Biosciences, Centre for Research and Technology Hellas, Thessaloniki, Greece
- Hematology Unit, 1st Dept of Internal Medicine, Aristotle University of Thessaloniki, AHEPA Hospital, Thessaloniki
| | - Valentin Junet
- Anaxomics Biotech SL, Barcelona, Spain
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Nikos Pechlivanis
- Institute of Applied Biosciences, Centre for Research and Technology Hellas, Thessaloniki, Greece
| | | | - Serovpe Amprachamian
- Institute of Applied Biosciences, Centre for Research and Technology Hellas, Thessaloniki, Greece
| | | | - Anastasia Chatzidimitriou
- Institute of Applied Biosciences, Centre for Research and Technology Hellas, Thessaloniki, Greece
- Department of Molecular Medicine and Surgery, Karolinska Institute, Stockholm, Sweden
| | - Maria Papaioannou
- Hematology Unit, 1st Dept of Internal Medicine, Aristotle University of Thessaloniki, AHEPA Hospital, Thessaloniki
| | - Kostas Stamatopoulos
- Institute of Applied Biosciences, Centre for Research and Technology Hellas, Thessaloniki, Greece
- Department of Molecular Medicine and Surgery, Karolinska Institute, Stockholm, Sweden
| | - Andreas Agathangelidis
- Institute of Applied Biosciences, Centre for Research and Technology Hellas, Thessaloniki, Greece
- Faculty of Biology, National and Kapodistrian University of Athens, Athens, Greece
| | - Fotis Psomopoulos
- Institute of Applied Biosciences, Centre for Research and Technology Hellas, Thessaloniki, Greece
| |
Collapse
|
3
|
Sandmann S, Richter S, Jiang X, Varghese J. Exploring Current Challenges and Perspectives for Automatic Reconstruction of Clonal Evolution. Cancer Genomics Proteomics 2022; 19:194-204. [PMID: 35181588 DOI: 10.21873/cgp.20314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Revised: 12/11/2021] [Accepted: 12/17/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND/AIM In the field of cancer research, reconstructing clonal evolution is of major interest. The technique provides new insights for analysis and prediction of tumor development. However, reconstruction based on mutational data is characterized by several challenges. MATERIALS AND METHODS By performing extensive literature research, we identified 51 currently available tools for reconstructing clonal evolution. By analyzing two cancer data sets (n=21), we investigated the applicability and performance of each tool. RESULTS Seventeen out of 51 tools could be applied to our data. Correct clustering of variants can be observed for 4 patients in the presence of ≤3 clusters and ≥5 time points. Correct phylogenetic trees are determined for 10 patients. Accurate visualization is possible, by applying adjustments to the original algorithms. CONCLUSION Despite bearing considerable potential, automatic reconstruction of clonal evolution remains challenging. To replace tedious manual reconstruction, further research including systematic error analyses using simulation tools needs to be conducted.
Collapse
Affiliation(s)
- Sarah Sandmann
- Institute of Medical Informatics, University of Münster, Münster, Germany;
| | - Silja Richter
- Institute of Medical Informatics, University of Münster, Münster, Germany
| | - Xiaoyi Jiang
- Department of Computer Science, University of Münster, Münster, Germany
| | - Julian Varghese
- Institute of Medical Informatics, University of Münster, Münster, Germany
| |
Collapse
|
4
|
Utro F, Levovitz C, Rhrissorrakrai K, Parida L. A common methodological phylogenomics framework for intra-patient heteroplasmies to infer SARS-CoV-2 sublineages and tumor clones. BMC Genomics 2021; 22:518. [PMID: 34789161 PMCID: PMC8596094 DOI: 10.1186/s12864-021-07660-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2021] [Accepted: 04/28/2021] [Indexed: 01/07/2023] Open
Abstract
BACKGROUND All diseases containing genetic material undergo genetic evolution and give rise to heterogeneity including cancer and infection. Although these illnesses are biologically very different, the ability for phylogenetic retrodiction based on the genomic reads is common between them and thus tree-based principles and assumptions are shared. Just as the different frequencies of tumor genomic variants presupposes the existence of multiple tumor clones and provides a handle to computationally infer them, we postulate that the different variant frequencies in viral reads offers the means to infer multiple co-infecting sublineages. RESULTS We present a common methodological framework to infer the phylogenomics from genomic data, be it reads of SARS-CoV-2 of multiple COVID-19 patients or bulk DNAseq of the tumor of a cancer patient. We describe the Concerti computational framework for inferring phylogenies in each of the two scenarios.To demonstrate the accuracy of the method, we reproduce some known results in both scenarios. We also make some additional discoveries. CONCLUSIONS Concerti successfully extracts and integrates information from multi-point samples, enabling the discovery of clinically plausible phylogenetic trees that capture the heterogeneity known to exist both spatially and temporally. These models can have direct therapeutic implications by highlighting "birth" of clones that may harbor resistance mechanisms to treatment, "death" of subclones with drug targets, and acquisition of functionally pertinent mutations in clones that may have seemed clinically irrelevant. Specifically in this paper we uncover new potential parallel mutations in the evolution of the SARS-CoV-2 virus. In the context of cancer, we identify new clones harboring resistant mutations to therapy.
Collapse
Affiliation(s)
- Filippo Utro
- IBM Research, T.J. Watson Research Center, Yorktown Heights, USA
| | - Chaya Levovitz
- IBM Research, T.J. Watson Research Center, Yorktown Heights, USA
| | | | - Laxmi Parida
- IBM Research, T.J. Watson Research Center, Yorktown Heights, USA
| |
Collapse
|
5
|
Behringer MG. Multi-omic Characterization of Intraspecies Variation in Laboratory and Natural Environments. mSystems 2021; 6:e0076421. [PMID: 34427516 PMCID: PMC8409731 DOI: 10.1128/msystems.00764-21] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
Investigation of microbial communities has led to many advances in our understanding of ecosystem function, whether that ecosystem is a subglacial lake or the human gut. Within these communities, much emphasis has been placed on interspecific variation and between-species relationships. However, with current advances in sequencing technology resulting in both the reduction in sequencing costs and the rise of shotgun metagenomic sequencing, the importance of intraspecific variation and within-species relationships is becoming realized. Our group conducts multi-omic analyses to understand how spatial structure and resource availability influence diversification within a species and the potential for long-term coexistence of multiple ecotypes within a microbial community. Here, we present examples of ecotypic variation observed in the lab and in the wild, current challenges faced when investigating intraspecies diversity, and future developments that we expect to define the field over the next 5 years.
Collapse
Affiliation(s)
- Megan G. Behringer
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee, USA
- Department of Pathology, Microbiology, and Immunology, Vanderbilt University Medical Center, Nashville, Tennessee, USA
- Vanderbilt Microbiome Initiative, Nashville, Tennessee, USA
| |
Collapse
|