1
|
Dinh KN, Vázquez-García I, Chan A, Malhotra R, Weiner A, McPherson AW, Tavaré S. CINner: Modeling and simulation of chromosomal instability in cancer at single-cell resolution. PLoS Comput Biol 2025; 21:e1012902. [PMID: 40179124 PMCID: PMC11990800 DOI: 10.1371/journal.pcbi.1012902] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2024] [Revised: 04/11/2025] [Accepted: 02/24/2025] [Indexed: 04/05/2025] Open
Abstract
Cancer development is characterized by chromosomal instability, manifesting in frequent occurrences of different genomic alteration mechanisms ranging in extent and impact. Mathematical modeling can help evaluate the role of each mutational process during tumor progression, however existing frameworks can only capture certain aspects of chromosomal instability (CIN). We present CINner, a mathematical framework for modeling genomic diversity and selection during tumor evolution. The main advantage of CINner is its flexibility to incorporate many genomic events that directly impact cellular fitness, from driver gene mutations to copy number alterations (CNAs), including focal amplifications and deletions, missegregations and whole-genome duplication (WGD). We apply CINner to find chromosome-arm selection parameters that drive tumorigenesis in the absence of WGD in chromosomally stable cancer types from the Pan-Cancer Analysis of Whole Genomes (PCAWG, [Formula: see text]). We found that the selection parameters predict WGD prevalence among different chromosomally unstable tumors, hinting that the selective advantage of WGD cells hinges on their tolerance for aneuploidy and escape from nullisomy. Analysis of inference results using CINner across cancer types in The Cancer Genome Atlas ([Formula: see text]) further reveals that the inferred selection parameters reflect the bias between tumor suppressor genes and oncogenes on specific genomic regions. Direct application of CINner to model the WGD proportion and fraction of genome altered (FGA) in PCAWG uncovers the increase in CNA probabilities associated with WGD in each cancer type. CINner can also be utilized to study chromosomally stable cancer types, by applying a selection model based on driver gene mutations and focal amplifications or deletions (chronic lymphocytic leukemia in PCAWG, [Formula: see text]). Finally, we used CINner to analyze the impact of CNA probabilities, chromosome selection parameters, tumor growth dynamics and population size on cancer fitness and heterogeneity. We expect that CINner will provide a powerful modeling tool for the oncology community to quantify the impact of newly uncovered genomic alteration mechanisms on shaping tumor progression and adaptation.
Collapse
Affiliation(s)
- Khanh N. Dinh
- Irving Institute for Cancer Dynamics, Columbia University, New York, New York, United States of America
- Department of Statistics, Columbia University, New York, New York, United States of America
| | - Ignacio Vázquez-García
- Irving Institute for Cancer Dynamics, Columbia University, New York, New York, United States of America
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, New York, United States of America
- Department of Pathology, Krantz Family Center for Cancer Research, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, United States of America
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Andrew Chan
- Case Western Reserve University, Cleveland, Ohio, United States of America
| | - Rhea Malhotra
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, New York, United States of America
- Stanford University, Palo Alto, California, United States of America
| | - Adam Weiner
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, New York, United States of America
- Tri-Institutional PhD Program in Computational Biology and Medicine, Weill Cornell Medicine, New York, New York, United States of America
| | - Andrew W. McPherson
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, New York, United States of America
| | - Simon Tavaré
- Irving Institute for Cancer Dynamics, Columbia University, New York, New York, United States of America
- Department of Statistics, Columbia University, New York, New York, United States of America
| |
Collapse
|
2
|
Kuipers J, Tuncel MA, Ferreira PF, Jahn K, Beerenwinkel N. Single-cell copy number calling and event history reconstruction. Bioinformatics 2025; 41:btaf072. [PMID: 39946094 PMCID: PMC11897432 DOI: 10.1093/bioinformatics/btaf072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2024] [Revised: 01/06/2025] [Accepted: 02/11/2025] [Indexed: 03/14/2025] Open
Abstract
MOTIVATION Copy number alterations are driving forces of tumour development and the emergence of intra-tumour heterogeneity. A comprehensive picture of these genomic aberrations is therefore essential for the development of personalised and precise cancer diagnostics and therapies. Single-cell sequencing offers the highest resolution for copy number profiling down to the level of individual cells. Recent high-throughput protocols allow for the processing of hundreds of cells through shallow whole-genome DNA sequencing. The resulting low read-depth data poses substantial statistical and computational challenges to the identification of copy number alterations. RESULTS We developed SCICoNE, a statistical model and MCMC algorithm tailored to single-cell copy number profiling from shallow whole-genome DNA sequencing data. SCICoNE reconstructs the history of copy number events in the tumour and uses these evolutionary relationships to identify the copy number profiles of the individual cells. We show the accuracy of this approach in evaluations on simulated data and demonstrate its practicability in applications to two breast cancer samples from different sequencing protocols. AVAILABILITY AND IMPLEMENTATION SCICoNE is available at https://github.com/cbg-ethz/SCICoNE.
Collapse
Affiliation(s)
- Jack Kuipers
- Department of Biosystems Science and Engineering, ETH Zurich, Basel 4056, Switzerland
- SIB Swiss Institute of Bioinformatics, Basel 4056, Switzerland
| | - Mustafa Anıl Tuncel
- Department of Biosystems Science and Engineering, ETH Zurich, Basel 4056, Switzerland
- SIB Swiss Institute of Bioinformatics, Basel 4056, Switzerland
| | - Pedro F Ferreira
- Department of Biosystems Science and Engineering, ETH Zurich, Basel 4056, Switzerland
- SIB Swiss Institute of Bioinformatics, Basel 4056, Switzerland
| | - Katharina Jahn
- Department of Biosystems Science and Engineering, ETH Zurich, Basel 4056, Switzerland
- SIB Swiss Institute of Bioinformatics, Basel 4056, Switzerland
| | - Niko Beerenwinkel
- Department of Biosystems Science and Engineering, ETH Zurich, Basel 4056, Switzerland
- SIB Swiss Institute of Bioinformatics, Basel 4056, Switzerland
| |
Collapse
|
3
|
Weiner S, Bansal MS. DICE: fast and accurate distance-based reconstruction of single-cell copy number phylogenies. Life Sci Alliance 2025; 8:e202402923. [PMID: 39667913 PMCID: PMC11638338 DOI: 10.26508/lsa.202402923] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2024] [Revised: 11/29/2024] [Accepted: 12/02/2024] [Indexed: 12/14/2024] Open
Abstract
Somatic copy number alterations (sCNAs) are valuable phylogenetic markers for inferring evolutionary relationships among tumor cell subpopulations. Advances in single-cell DNA sequencing technologies are making it possible to obtain such sCNAs datasets at ever-larger scales. However, existing methods for reconstructing phylogenies from sCNAs are often too slow for large datasets. We propose two new distance-based methods, DICE-bar and DICE-star, for reconstructing single-cell tumor phylogenies from sCNA data. Using carefully simulated datasets, we find that DICE-bar matches or exceeds the accuracies of all other methods on noise-free datasets and that DICE-star shows exceptional robustness to noise and outperforms all other methods on noisy datasets. Both methods are also orders of magnitude faster than many existing methods. Our experimental analysis also reveals how noise/error in copy number inference, as expected for real datasets, can drastically impact the accuracies of most methods. We apply DICE-star, the most accurate method on error-prone datasets, to several real single-cell breast and ovarian cancer datasets and find that it rapidly produces phylogenies of equivalent or greater reliability compared with existing methods.
Collapse
Affiliation(s)
- Samson Weiner
- School of Computing, University of Connecticut, Storrs, CT, USA
| | - Mukul S Bansal
- School of Computing, University of Connecticut, Storrs, CT, USA
- The Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
| |
Collapse
|
4
|
Potu T, Hu Y, Khan R, Dharani S, Ni J, Zhang L, Zhou XM, Mallory X. SCGclust: Single Cell Graph clustering using graph autoencoders integrating SNVs and CNAs. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.01.28.635357. [PMID: 39975167 PMCID: PMC11838312 DOI: 10.1101/2025.01.28.635357] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 02/21/2025]
Abstract
Intra-tumor heterogeneity (ITH) is a compounding factor for cancer prognosis and treatment. Single-cell DNA sequencing (scDNA-seq) provides cellular resolution of the variations in a cell and has been widely used to study cancer progression and responses to drug and treatment. While the low coverage scDNA-seq technologies typically provides a large number of cells, accurate cell clustering is essential for effectively characterizing ITH. Existing cell clustering methods typically are based on either single nucleotide variations (SNV) or copy number alterations (CNA), without leveraging both signals together. Since both SNVs and CNAs are indicative of the cell subclonality, in this paper, we designed a robust cell clustering tool that integrates both signals using a graph autoencoder. Our model co-trains the graph autoencoder and a graph convolutional network (GCN) to guanrantee meaningful clustering results and to prevent all cells from collapsing into a single cluster. Given the low dimensional embedding generated by the autoencoder, we adopted a Gaussian Mixture Model to further cluster cells. We evaluated our method on eight simulated datasets and a real cancer sample. Our results demonstrate that our method consistently achieves higher V-measure scores compared to SBMClone, a SNV-based method, and a K-means method, which relies solely on CNA signals. These findings highlight the advantage of integrating both SNV and CNA signals within a graph autoencoder framework for accurate cell clustering. SCGclust is publicly available at https://github.com/compbio-mallory/cellClustering_GNN.
Collapse
Affiliation(s)
- Teja Potu
- Department of Computer Science, Florida State University, 222 S. Copeland St. Tallahassee, 32306, Florida, United States
| | - Yunfei Hu
- Department of Computer Science, Vanderbilt University, 2201 West End Ave, Nashville, 37235, Tennessee, United States
| | - Rituparna Khan
- Department of Computer Science, Florida State University, 222 S. Copeland St. Tallahassee, 32306, Florida, United States
| | - Srinija Dharani
- Department of Computer Science, Florida State University, 222 S. Copeland St. Tallahassee, 32306, Florida, United States
| | - Jingchao Ni
- Department of Computer Science, University of Houston, 4302 University Dr, Houston, 77004, Texas, United States
| | - Liting Zhang
- Department of Computer Science, Florida State University, 222 S. Copeland St. Tallahassee, 32306, Florida, United States
| | - Xin Maizie Zhou
- Department of Computer Science, Vanderbilt University, 2201 West End Ave, Nashville, 37235, Tennessee, United States
- Department of Biomedical Engineering, Vanderbilt University, 2201 West End Ave, Nashville, 37235, Tennessee, United States
| | - Xian Mallory
- Department of Computer Science, Florida State University, 222 S. Copeland St. Tallahassee, 32306, Florida, United States
| |
Collapse
|
5
|
Grima-Terrén M, Campanario S, Ramírez-Pardo I, Cisneros A, Hong X, Perdiguero E, Serrano AL, Isern J, Muñoz-Cánoves P. Muscle aging and sarcopenia: The pathology, etiology, and most promising therapeutic targets. Mol Aspects Med 2024; 100:101319. [PMID: 39312874 DOI: 10.1016/j.mam.2024.101319] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Revised: 09/13/2024] [Accepted: 09/16/2024] [Indexed: 09/25/2024]
Abstract
Sarcopenia is a progressive muscle wasting disorder that severely impacts the quality of life of elderly individuals. Although the natural aging process primarily causes sarcopenia, it can develop in response to other conditions. Because muscle function is influenced by numerous changes that occur with age, the etiology of sarcopenia remains unclear. However, recent characterizations of the aging muscle transcriptional landscape, signaling pathway disruptions, fiber and extracellular matrix compositions, systemic metabolomic and inflammatory responses, mitochondrial function, and neurological inputs offer insights and hope for future treatments. This review will discuss age-related changes in healthy muscle and our current understanding of how this can deteriorate into sarcopenia. As our elderly population continues to grow, we must understand sarcopenia and find treatments that allow individuals to maintain independence and dignity throughout an extended lifespan.
Collapse
Affiliation(s)
- Mercedes Grima-Terrén
- Altos Labs, San Diego Institute of Science, San Diego, CA, 92121, USA; Department of Medicine and Life Sciences, Universitat Pompeu Fabra (UPF), Barcelona, 08003, Spain
| | - Silvia Campanario
- Altos Labs, San Diego Institute of Science, San Diego, CA, 92121, USA; Department of Medicine and Life Sciences, Universitat Pompeu Fabra (UPF), Barcelona, 08003, Spain
| | - Ignacio Ramírez-Pardo
- Altos Labs, San Diego Institute of Science, San Diego, CA, 92121, USA; Department of Medicine and Life Sciences, Universitat Pompeu Fabra (UPF), Barcelona, 08003, Spain
| | - Andrés Cisneros
- Altos Labs, San Diego Institute of Science, San Diego, CA, 92121, USA; Department of Medicine and Life Sciences, Universitat Pompeu Fabra (UPF), Barcelona, 08003, Spain
| | - Xiaotong Hong
- Altos Labs, San Diego Institute of Science, San Diego, CA, 92121, USA
| | | | - Antonio L Serrano
- Altos Labs, San Diego Institute of Science, San Diego, CA, 92121, USA
| | - Joan Isern
- Altos Labs, San Diego Institute of Science, San Diego, CA, 92121, USA
| | - Pura Muñoz-Cánoves
- Altos Labs, San Diego Institute of Science, San Diego, CA, 92121, USA; Department of Medicine and Life Sciences, Universitat Pompeu Fabra (UPF), Barcelona, 08003, Spain.
| |
Collapse
|
6
|
Kalef-Ezra E, Turan ZG, Perez-Rodriguez D, Bomann I, Behera S, Morley C, Scholz SW, Jaunmuktane Z, Demeulemeester J, Sedlazeck FJ, Proukakis C. Single-cell somatic copy number variants in brain using different amplification methods and reference genomes. Commun Biol 2024; 7:1288. [PMID: 39384904 PMCID: PMC11464624 DOI: 10.1038/s42003-024-06940-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Accepted: 09/23/2024] [Indexed: 10/11/2024] Open
Abstract
The presence of somatic mutations, including copy number variants (CNVs), in the brain is well recognized. Comprehensive study requires single-cell whole genome amplification, with several methods available, prior to sequencing. Here we compare PicoPLEX with two recent adaptations of multiple displacement amplification (MDA): primary template-directed amplification (PTA) and droplet MDA, across 93 human brain cortical nuclei. We demonstrate different properties for each, with PTA providing the broadest amplification, PicoPLEX the most even, and distinct chimeric profiles. Furthermore, we perform CNV calling on two brains with multiple system atrophy and one control brain using different reference genomes. We find that 20.6% of brain cells have at least one Mb-scale CNV, with some supported by bulk sequencing or single-cells from other brain regions. Our study highlights the importance of selecting whole genome amplification method and reference genome for CNV calling, while supporting the existence of somatic CNVs in healthy and diseased human brain.
Collapse
Affiliation(s)
- Ester Kalef-Ezra
- Department of Clinical and Movement Neurosciences, UCL Queen Square Institute of Neurology, London, UK
- Aligning Science Across Parkinson's (ASAP) Collaborative Research Network, Chevy Chase, MD, USA
| | - Zeliha Gozde Turan
- Department of Clinical and Movement Neurosciences, UCL Queen Square Institute of Neurology, London, UK
- Aligning Science Across Parkinson's (ASAP) Collaborative Research Network, Chevy Chase, MD, USA
| | - Diego Perez-Rodriguez
- Department of Clinical and Movement Neurosciences, UCL Queen Square Institute of Neurology, London, UK
| | - Ida Bomann
- Department of Clinical and Movement Neurosciences, UCL Queen Square Institute of Neurology, London, UK
| | - Sairam Behera
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
| | - Caoimhe Morley
- Department of Clinical and Movement Neurosciences, UCL Queen Square Institute of Neurology, London, UK
| | - Sonja W Scholz
- Neurodegenerative Diseases Research Section, National Institute of Neurological Disorders and Stroke, Bethesda, MD, USA
- Department of Neurology, Johns Hopkins University Medical Center, Baltimore, MD, USA
| | - Zane Jaunmuktane
- Department of Clinical and Movement Neurosciences, UCL Queen Square Institute of Neurology, London, UK
- Aligning Science Across Parkinson's (ASAP) Collaborative Research Network, Chevy Chase, MD, USA
- Queen Square Brain Bank for Neurological disorders, UCL Queen Square Institute of Neurology, London, UK
| | - Jonas Demeulemeester
- Aligning Science Across Parkinson's (ASAP) Collaborative Research Network, Chevy Chase, MD, USA
- Department of Oncology, KU Leuven, Leuven, Belgium
- Cancer Genomics Laboratory, The Francis Crick Institute, London, UK
- VIB Center for Cancer Biology, Leuven, Belgium
| | - Fritz J Sedlazeck
- Aligning Science Across Parkinson's (ASAP) Collaborative Research Network, Chevy Chase, MD, USA
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Christos Proukakis
- Department of Clinical and Movement Neurosciences, UCL Queen Square Institute of Neurology, London, UK.
- Aligning Science Across Parkinson's (ASAP) Collaborative Research Network, Chevy Chase, MD, USA.
| |
Collapse
|
7
|
Smolka M, Paulin LF, Grochowski CM, Horner DW, Mahmoud M, Behera S, Kalef-Ezra E, Gandhi M, Hong K, Pehlivan D, Scholz SW, Carvalho CMB, Proukakis C, Sedlazeck FJ. Detection of mosaic and population-level structural variants with Sniffles2. Nat Biotechnol 2024; 42:1571-1580. [PMID: 38168980 PMCID: PMC11217151 DOI: 10.1038/s41587-023-02024-y] [Citation(s) in RCA: 92] [Impact Index Per Article: 92.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Accepted: 10/11/2023] [Indexed: 01/05/2024]
Abstract
Calling structural variations (SVs) is technically challenging, but using long reads remains the most accurate way to identify complex genomic alterations. Here we present Sniffles2, which improves over current methods by implementing a repeat aware clustering coupled with a fast consensus sequence and coverage-adaptive filtering. Sniffles2 is 11.8 times faster and 29% more accurate than state-of-the-art SV callers across different coverages (5-50×), sequencing technologies (ONT and HiFi) and SV types. Furthermore, Sniffles2 solves the problem of family-level to population-level SV calling to produce fully genotyped VCF files. Across 11 probands, we accurately identified causative SVs around MECP2, including highly complex alleles with three overlapping SVs. Sniffles2 also enables the detection of mosaic SVs in bulk long-read data. As a result, we identified multiple mosaic SVs in brain tissue from a patient with multiple system atrophy. The identified SV showed a remarkable diversity within the cingulate cortex, impacting both genes involved in neuron function and repetitive elements.
Collapse
Affiliation(s)
- Moritz Smolka
- Human Genome Sequencing Center Baylor College of Medicine, Houston, TX, USA
| | - Luis F Paulin
- Human Genome Sequencing Center Baylor College of Medicine, Houston, TX, USA
| | | | - Dominic W Horner
- Department of Clinical and Movement Neurosciences, Royal Free Campus, Queen Square Institute of Neurology, University College London, London, UK
- Aligning Science Across Parkinson's (ASAP) Collaborative Research Network, Chevy Chase, MD, USA
| | - Medhat Mahmoud
- Human Genome Sequencing Center Baylor College of Medicine, Houston, TX, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Sairam Behera
- Human Genome Sequencing Center Baylor College of Medicine, Houston, TX, USA
| | - Ester Kalef-Ezra
- Department of Clinical and Movement Neurosciences, Royal Free Campus, Queen Square Institute of Neurology, University College London, London, UK
- Aligning Science Across Parkinson's (ASAP) Collaborative Research Network, Chevy Chase, MD, USA
| | - Mira Gandhi
- Pacific Northwest Research Institute (PNRI), Seattle, WA, USA
| | - Karl Hong
- Bionano Genomics, San Diego, CA, USA
| | - Davut Pehlivan
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
- Division of Neurology and Developmental Neuroscience, Department of Pediatrics, Baylor College of Medicine, Houston, TX, USA
| | - Sonja W Scholz
- Neurodegenerative Diseases Research Unit, National Institute of Neurological Disorders and Stroke, Bethesda, MD, USA
- Department of Neurology, Johns Hopkins University Medical Center, Baltimore, MD, USA
| | - Claudia M B Carvalho
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
- Pacific Northwest Research Institute (PNRI), Seattle, WA, USA
| | - Christos Proukakis
- Department of Clinical and Movement Neurosciences, Royal Free Campus, Queen Square Institute of Neurology, University College London, London, UK
- Aligning Science Across Parkinson's (ASAP) Collaborative Research Network, Chevy Chase, MD, USA
| | - Fritz J Sedlazeck
- Human Genome Sequencing Center Baylor College of Medicine, Houston, TX, USA.
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA.
- Aligning Science Across Parkinson's (ASAP) Collaborative Research Network, Chevy Chase, MD, USA.
- Department of Computer Science, Rice University, Houston, TX, USA.
| |
Collapse
|
8
|
Zhang L, Zhou XM, Mallory X. SCCNAInfer: a robust and accurate tool to infer the absolute copy number on scDNA-seq data. BIOINFORMATICS (OXFORD, ENGLAND) 2024; 40:btae454. [PMID: 39067018 DOI: 10.1093/bioinformatics/btae454] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Revised: 06/13/2024] [Accepted: 07/26/2024] [Indexed: 07/30/2024]
Abstract
MOTIVATION Copy number alterations (CNAs) play an important role in disease progression, especially in cancer. Single-cell DNA sequencing (scDNA-seq) facilitates the detection of CNAs of each cell that is sequenced at a shallow and uneven coverage. However, the state-of-the-art CNA detection tools based on scDNA-seq are still subject to genome-wide errors due to the wrong estimation of the ploidy. RESULTS We developed SCCNAInfer, a computational tool that utilizes the subclonal signal inside the tumor cells to more accurately infer each cell's ploidy and CNAs. Given the segmentation result of an existing CNA detection method, SCCNAInfer clusters the cells, infers the ploidy of each subclone, refines the read count by bin clustering, and accurately infers the CNAs for each cell. Both simulated and real datasets show that SCCNAInfer consistently improves upon the state-of-the-art CNA detection tools such as Aneufinder, Ginkgo, SCOPE and SeCNV. AVAILABILITY AND IMPLEMENTATION SCCNAInfer is freely available at https://github.com/compbio-mallory/SCCNAInfer. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Liting Zhang
- Department of Computer Science, Florida State University, Florida 32304, USA
| | - Xin Maizie Zhou
- Department of Biomedical Engineering, Vanderbilt University, Tennessee 37235, USA
| | - Xian Mallory
- Department of Computer Science, Florida State University, Florida 32304, USA
| |
Collapse
|
9
|
Kurt S, Chen M, Toosi H, Chen X, Engblom C, Mold J, Hartman J, Lagergren J. CopyVAE: a variational autoencoder-based approach for copy number variation inference using single-cell transcriptomics. Bioinformatics 2024; 40:btae284. [PMID: 38676578 PMCID: PMC11087824 DOI: 10.1093/bioinformatics/btae284] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Revised: 03/06/2024] [Accepted: 04/25/2024] [Indexed: 04/29/2024] Open
Abstract
MOTIVATION Copy number variations (CNVs) are common genetic alterations in tumour cells. The delineation of CNVs holds promise for enhancing our comprehension of cancer progression. Moreover, accurate inference of CNVs from single-cell sequencing data is essential for unravelling intratumoral heterogeneity. However, existing inference methods face limitations in resolution and sensitivity. RESULTS To address these challenges, we present CopyVAE, a deep learning framework based on a variational autoencoder architecture. Through experiments, we demonstrated that CopyVAE can accurately and reliably detect CNVs from data obtained using single-cell RNA sequencing. CopyVAE surpasses existing methods in terms of sensitivity and specificity. We also discussed CopyVAE's potential to advance our understanding of genetic alterations and their impact on disease advancement. AVAILABILITY AND IMPLEMENTATION CopyVAE is implemented and freely available under MIT license at https://github.com/kurtsemih/copyVAE.
Collapse
Affiliation(s)
- Semih Kurt
- School of EECS and SciLifeLab, KTH Royal Institute of Technology, Stockholm, 100 44, Sweden
| | - Mandi Chen
- School of EECS and SciLifeLab, KTH Royal Institute of Technology, Stockholm, 100 44, Sweden
| | - Hosein Toosi
- School of EECS and SciLifeLab, KTH Royal Institute of Technology, Stockholm, 100 44, Sweden
| | - Xinsong Chen
- Department of Oncology and Pathology, Karolinska Institutet, Solna, 171 77, Sweden
| | - Camilla Engblom
- Department of Cell and Molecular Biology, Karolinska Institutet, Solna, 171 77, Sweden
| | - Jeff Mold
- Department of Cell and Molecular Biology, Karolinska Institutet, Solna, 171 77, Sweden
| | - Johan Hartman
- Department of Oncology and Pathology, Karolinska Institutet, Solna, 171 77, Sweden
- Department of Clinical Pathology and Cytology, Karolinska University Laboratory, Solna, 171 76, Sweden
| | - Jens Lagergren
- School of EECS and SciLifeLab, KTH Royal Institute of Technology, Stockholm, 100 44, Sweden
| |
Collapse
|
10
|
Liu Y, Edrisi M, Yan Z, A Ogilvie H, Nakhleh L. NestedBD: Bayesian inference of phylogenetic trees from single-cell copy number profiles under a birth-death model. Algorithms Mol Biol 2024; 19:18. [PMID: 38685065 PMCID: PMC11059640 DOI: 10.1186/s13015-024-00264-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2023] [Accepted: 03/27/2024] [Indexed: 05/02/2024] Open
Abstract
Copy number aberrations (CNAs) are ubiquitous in many types of cancer. Inferring CNAs from cancer genomic data could help shed light on the initiation, progression, and potential treatment of cancer. While such data have traditionally been available via "bulk sequencing," the more recently introduced techniques for single-cell DNA sequencing (scDNAseq) provide the type of data that makes CNA inference possible at the single-cell resolution. We introduce a new birth-death evolutionary model of CNAs and a Bayesian method, NestedBD, for the inference of evolutionary trees (topologies and branch lengths with relative mutation rates) from single-cell data. We evaluated NestedBD's performance using simulated data sets, benchmarking its accuracy against traditional phylogenetic tools as well as state-of-the-art methods. The results show that NestedBD infers more accurate topologies and branch lengths, and that the birth-death model can improve the accuracy of copy number estimation. And when applied to biological data sets, NestedBD infers plausible evolutionary histories of two colorectal cancer samples. NestedBD is available at https://github.com/Androstane/NestedBD .
Collapse
Affiliation(s)
- Yushu Liu
- Department of Computer Science, Rice University, 6100 Main St, Houston, 77005, TX, USA.
| | - Mohammadamin Edrisi
- Department of Computer Science, Rice University, 6100 Main St, Houston, 77005, TX, USA
| | - Zhi Yan
- Department of Computer Science, Rice University, 6100 Main St, Houston, 77005, TX, USA
| | - Huw A Ogilvie
- Department of Genetics, University of Texas MD Anderson Cancer Center, TX, 77030, Houston, USA
| | - Luay Nakhleh
- Department of Computer Science, Rice University, 6100 Main St, Houston, 77005, TX, USA
| |
Collapse
|
11
|
Dinh KN, Vázquez-García I, Chan A, Malhotra R, Weiner A, McPherson AW, Tavaré S. CINner: modeling and simulation of chromosomal instability in cancer at single-cell resolution. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.03.587939. [PMID: 38617259 PMCID: PMC11014621 DOI: 10.1101/2024.04.03.587939] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/16/2024]
Abstract
Cancer development is characterized by chromosomal instability, manifesting in frequent occurrences of different genomic alteration mechanisms ranging in extent and impact. Mathematical modeling can help evaluate the role of each mutational process during tumor progression, however existing frameworks can only capture certain aspects of chromosomal instability (CIN). We present CINner, a mathematical framework for modeling genomic diversity and selection during tumor evolution. The main advantage of CINner is its flexibility to incorporate many genomic events that directly impact cellular fitness, from driver gene mutations to copy number alterations (CNAs), including focal amplifications and deletions, missegregations and whole-genome duplication (WGD). We apply CINner to find chromosome-arm selection parameters that drive tumorigenesis in the absence of WGD in chromosomally stable cancer types. We found that the selection parameters predict WGD prevalence among different chromosomally unstable tumors, hinting that the selective advantage of WGD cells hinges on their tolerance for aneuploidy and escape from nullisomy. Direct application of CINner to model the WGD proportion and fraction of genome altered (FGA) further uncovers the increase in CNA probabilities associated with WGD in each cancer type. CINner can also be utilized to study chromosomally stable cancer types, by applying a selection model based on driver gene mutations and focal amplifications or deletions. Finally, we used CINner to analyze the impact of CNA probabilities, chromosome selection parameters, tumor growth dynamics and population size on cancer fitness and heterogeneity. We expect that CINner will provide a powerful modeling tool for the oncology community to quantify the impact of newly uncovered genomic alteration mechanisms on shaping tumor progression and adaptation.
Collapse
Affiliation(s)
- Khanh N. Dinh
- Irving Institute for Cancer Dynamics, Columbia University, New York, NY, USA
- Department of Statistics, Columbia University, New York, NY, USA
| | - Ignacio Vázquez-García
- Irving Institute for Cancer Dynamics, Columbia University, New York, NY, USA
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Andrew Chan
- Case Western Reserve University, Cleveland, OH, USA
| | - Rhea Malhotra
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Stanford University, Palo Alto, CA, USA
| | - Adam Weiner
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Tri-Institutional PhD Program in Computational Biology and Medicine, Weill Cornell Medicine, New York, NY, USA
| | - Andrew W. McPherson
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Simon Tavaré
- Irving Institute for Cancer Dynamics, Columbia University, New York, NY, USA
- Department of Statistics, Columbia University, New York, NY, USA
| |
Collapse
|
12
|
Schneider MP, Cullen AE, Pangonyte J, Skelton J, Major H, Van Oudenhove E, Garcia MJ, Chaves Urbano B, Piskorz AM, Brenton JD, Macintyre G, Markowetz F. scAbsolute: measuring single-cell ploidy and replication status. Genome Biol 2024; 25:62. [PMID: 38438920 PMCID: PMC10910719 DOI: 10.1186/s13059-024-03204-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Accepted: 02/22/2024] [Indexed: 03/06/2024] Open
Abstract
Cancer cells often exhibit DNA copy number aberrations and can vary widely in their ploidy. Correct estimation of the ploidy of single-cell genomes is paramount for downstream analysis. Based only on single-cell DNA sequencing information, scAbsolute achieves accurate and unbiased measurement of single-cell ploidy and replication status, including whole-genome duplications. We demonstrate scAbsolute's capabilities using experimental cell multiplets, a FUCCI cell cycle expression system, and a benchmark against state-of-the-art methods. scAbsolute provides a robust foundation for single-cell DNA sequencing analysis across different technologies and has the potential to enable improvements in a number of downstream analyses.
Collapse
Affiliation(s)
- Michael P Schneider
- University of Cambridge, Cambridge, UK
- Cancer Research UK Cambridge Institute, Robinson Way, Cambridge, UK
| | - Amy E Cullen
- University of Cambridge, Cambridge, UK
- Cancer Research UK Cambridge Institute, Robinson Way, Cambridge, UK
| | - Justina Pangonyte
- University of Cambridge, Cambridge, UK
- Cancer Research UK Cambridge Institute, Robinson Way, Cambridge, UK
| | - Jason Skelton
- University of Cambridge, Cambridge, UK
- Cancer Research UK Cambridge Institute, Robinson Way, Cambridge, UK
| | - Harvey Major
- University of Cambridge, Cambridge, UK
- Cancer Research UK Cambridge Institute, Robinson Way, Cambridge, UK
| | - Elke Van Oudenhove
- University of Cambridge, Cambridge, UK
- Cancer Research UK Cambridge Institute, Robinson Way, Cambridge, UK
| | - Maria J Garcia
- Spanish National Cancer Research Centre (CNIO), Madrid, Spain
| | | | - Anna M Piskorz
- University of Cambridge, Cambridge, UK
- Cancer Research UK Cambridge Institute, Robinson Way, Cambridge, UK
| | - James D Brenton
- University of Cambridge, Cambridge, UK
- Cancer Research UK Cambridge Institute, Robinson Way, Cambridge, UK
| | - Geoff Macintyre
- Spanish National Cancer Research Centre (CNIO), Madrid, Spain
| | - Florian Markowetz
- University of Cambridge, Cambridge, UK.
- Cancer Research UK Cambridge Institute, Robinson Way, Cambridge, UK.
| |
Collapse
|
13
|
Addala V, Newell F, Pearson JV, Redwood A, Robinson BW, Creaney J, Waddell N. Computational immunogenomic approaches to predict response to cancer immunotherapies. Nat Rev Clin Oncol 2024; 21:28-46. [PMID: 37907723 DOI: 10.1038/s41571-023-00830-6] [Citation(s) in RCA: 26] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/03/2023] [Indexed: 11/02/2023]
Abstract
Cancer immunogenomics is an emerging field that bridges genomics and immunology. The establishment of large-scale genomic collaborative efforts along with the development of new single-cell transcriptomic techniques and multi-omics approaches have enabled characterization of the mutational and transcriptional profiles of many cancer types and helped to identify clinically actionable alterations as well as predictive and prognostic biomarkers. Researchers have developed computational approaches and machine learning algorithms to accurately obtain clinically useful information from genomic and transcriptomic sequencing data from bulk tissue or single cells and explore tumours and their microenvironment. The rapid growth in sequencing and computational approaches has resulted in the unmet need to understand their true potential and limitations in enabling improvements in the management of patients with cancer who are receiving immunotherapies. In this Review, we describe the computational approaches currently available to analyse bulk tissue and single-cell sequencing data from cancer, stromal and immune cells, as well as how best to select the most appropriate tool to address various clinical questions and, ultimately, improve patient outcomes.
Collapse
Affiliation(s)
- Venkateswar Addala
- Cancer Program, QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia.
- Faculty of Medicine, The University of Queensland, Brisbane, Queensland, Australia.
| | - Felicity Newell
- Cancer Program, QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - John V Pearson
- Cancer Program, QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - Alec Redwood
- National Centre for Asbestos Related Diseases, University of Western Australia, Perth, Western Australia, Australia
- Institute of Respiratory Health, Perth, Western Australia, Australia
- School of Biomedical Science, University of Western Australia, Perth, Western Australia, Australia
| | - Bruce W Robinson
- National Centre for Asbestos Related Diseases, University of Western Australia, Perth, Western Australia, Australia
- Institute of Respiratory Health, Perth, Western Australia, Australia
- Department of Respiratory Medicine, Sir Charles Gairdner Hospital, Perth, Western Australia, Australia
- Medical School, University of Western Australia, Perth, Western Australia, Australia
| | - Jenette Creaney
- National Centre for Asbestos Related Diseases, University of Western Australia, Perth, Western Australia, Australia
- Institute of Respiratory Health, Perth, Western Australia, Australia
- School of Biomedical Science, University of Western Australia, Perth, Western Australia, Australia
- Department of Respiratory Medicine, Sir Charles Gairdner Hospital, Perth, Western Australia, Australia
| | - Nicola Waddell
- Cancer Program, QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia.
- Faculty of Medicine, The University of Queensland, Brisbane, Queensland, Australia.
| |
Collapse
|
14
|
Zhang L, Bass HW, Irianto J, Mallory X. Integrating SNVs and CNAs on a phylogenetic tree from single-cell DNA sequencing data. Genome Res 2023; 33:2002-2017. [PMID: 37993137 PMCID: PMC10760445 DOI: 10.1101/gr.277249.122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2022] [Accepted: 10/25/2023] [Indexed: 11/24/2023]
Abstract
Single-cell DNA sequencing enables the construction of evolutionary trees that can reveal how tumors gain mutations and grow. Different whole-genome amplification procedures render genomic materials of different characteristics, often suitable for the detection of either single-nucleotide variation or copy number aberration, but not ideally for both. Consequently, this hinders the inference of a comprehensive phylogenetic tree and limits opportunities to investigate the interplay of SNVs and CNAs. Existing methods such as SCARLET and COMPASS require that the SNVs and CNAs are detected from the same sets of cells, which is technically challenging. Here we present a novel computational tool, SCsnvcna, that places SNVs on a tree inferred from CNA signals, whereas the sets of cells rendering the SNVs and CNAs are independent, offering a more practical solution in terms of the technical challenges. SCsnvcna is a Bayesian probabilistic model using both the genotype constraints on the tree and the cellular prevalence to search the optimal solution. Comprehensive simulations and comparison with seven state-of-the-art methods show that SCsnvcna is robust and accurate in a variety of circumstances. Particularly, SCsnvcna most frequently produces the lowest error rates, with ability to scale to a wide range of numerical values for leaf nodes in the tree, SNVs, and SNV cells. The application of SCsnvcna to two published colorectal cancer data sets shows highly consistent placement of SNV cells and SNVs with the original study while also supporting a refined placement of ATP7B, illustrating SCsnvcna's value in analyzing complex multitumor samples.
Collapse
Affiliation(s)
- Liting Zhang
- Department of Computer Science, Florida State University, Tallahassee, Florida 32306, USA
| | - Hank W Bass
- Department of Biological Science, Florida State University, Tallahassee, Florida 32306, USA
| | - Jerome Irianto
- College of Medicine, Florida State University, Tallahassee, Florida 32306, USA
| | - Xian Mallory
- Department of Computer Science, Florida State University, Tallahassee, Florida 32306, USA;
| |
Collapse
|
15
|
Kalef-Ezra E, Turan ZG, Perez-Rodriguez D, Bomann I, Behera S, Morley C, Scholz SW, Jaunmuktane Z, Demeulemeester J, Sedlazeck FJ, Proukakis C. Single-cell somatic copy number variants in brain using different amplification methods and reference genomes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.07.552289. [PMID: 37609320 PMCID: PMC10441336 DOI: 10.1101/2023.08.07.552289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/24/2023]
Abstract
The presence of somatic mutations, including copy number variants (CNVs), in the brain is well recognized. Comprehensive study requires single-cell whole genome amplification, with several methods available, prior to sequencing. We compared PicoPLEX with two recent adaptations of multiple displacement amplification (MDA): primary template-directed amplification (PTA) and droplet MDA, across 93 human brain cortical nuclei. We demonstrated different properties for each, with PTA providing the broadest amplification, PicoPLEX the most even, and distinct chimeric profiles. Furthermore, we performed CNV calling on two brains with multiple system atrophy and one control brain using different reference genomes. We found that 38% of brain cells have at least one Mb-scale CNV, with some supported by bulk sequencing or single-cells from other brain regions. Our study highlights the importance of selecting whole genome amplification method and reference genome for CNV calling, while supporting the existence of somatic CNVs in healthy and diseased human brain.
Collapse
Affiliation(s)
- Ester Kalef-Ezra
- Department of Clinical and Movement Neurosciences, UCL Queen Square Institute of Neurology, London, UK
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD, 20815
| | - Zeliha Gozde Turan
- Department of Clinical and Movement Neurosciences, UCL Queen Square Institute of Neurology, London, UK
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD, 20815
| | - Diego Perez-Rodriguez
- Department of Clinical and Movement Neurosciences, UCL Queen Square Institute of Neurology, London, UK
| | - Ida Bomann
- Department of Clinical and Movement Neurosciences, UCL Queen Square Institute of Neurology, London, UK
| | - Sairam Behera
- Human Genome Sequencing Center, Baylor College of Medicine, One Baylor Plaza, Houston TX 77030, USA
| | - Caoimhe Morley
- Department of Clinical and Movement Neurosciences, UCL Queen Square Institute of Neurology, London, UK
| | - Sonja W. Scholz
- Neurodegenerative Diseases Research Unit, National Institute of Neurological Disorders and Stroke, Bethesda, MD, USA
- Department of Neurology, Johns Hopkins University Medical Center, Baltimore, MD, USA
| | - Zane Jaunmuktane
- Department of Clinical and Movement Neurosciences, UCL Queen Square Institute of Neurology, London, UK
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD, 20815
- Queen Square Brain Bank for Neurological disorders, UCL Queen Square Institute of Neurology, London, UK
| | - Jonas Demeulemeester
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD, 20815
- Department of Oncology, KU Leuven, Leuven, Belgium
- Cancer Genomics Laboratory, The Francis Crick Institute, London, UK
- VIB Center for Cancer Biology, Leuven, Belgium
| | - Fritz J Sedlazeck
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD, 20815
- Human Genome Sequencing Center, Baylor College of Medicine, One Baylor Plaza, Houston TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, TX, USA
- Department of Computer Science, Rice University, 6100 Main Street, Houston, TX, USA
| | - Christos Proukakis
- Department of Clinical and Movement Neurosciences, UCL Queen Square Institute of Neurology, London, UK
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD, 20815
| |
Collapse
|
16
|
Ramakrishnan A, Symeonidi A, Hanel P, Schmid KT, Richter ML, Schubert M, Colomé-Tatché M. epiAneufinder identifies copy number alterations from single-cell ATAC-seq data. Nat Commun 2023; 14:5846. [PMID: 37730813 PMCID: PMC10511508 DOI: 10.1038/s41467-023-41076-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Accepted: 08/23/2023] [Indexed: 09/22/2023] Open
Abstract
Single-cell open chromatin profiling via scATAC-seq has become a mainstream measurement of open chromatin in single-cells. Here we present epiAneufinder, an algorithm that exploits the read count information from scATAC-seq data to extract genome-wide copy number alterations (CNAs) for individual cells, allowing the study of CNA heterogeneity present in a sample at the single-cell level. Using different cancer scATAC-seq datasets, we show that epiAneufinder can identify intratumor clonal heterogeneity in populations of single cells based on their CNA profiles. We demonstrate that these profiles are concordant with the ones inferred from single-cell whole genome sequencing data for the same samples. EpiAneufinder allows the inference of single-cell CNA information from scATAC-seq data, without the need of additional experiments, unlocking a layer of genomic variation which is otherwise unexplored.
Collapse
Affiliation(s)
- Akshaya Ramakrishnan
- Institute of Computational Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
| | - Aikaterini Symeonidi
- Institute of Computational Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany.
- Biomedical Center (BMC), Physiological Chemistry, Faculty of Medicine, LMU Munich, Planegg-Martinsried, Germany.
| | - Patrick Hanel
- Institute of Computational Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
- Biomedical Center (BMC), Physiological Chemistry, Faculty of Medicine, LMU Munich, Planegg-Martinsried, Germany
| | - Katharina T Schmid
- Biomedical Center (BMC), Physiological Chemistry, Faculty of Medicine, LMU Munich, Planegg-Martinsried, Germany
| | - Maria L Richter
- Biomedical Center (BMC), Physiological Chemistry, Faculty of Medicine, LMU Munich, Planegg-Martinsried, Germany
| | - Michael Schubert
- Oncode Institute, Division of Cell Biology, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX, Amsterdam, the Netherlands
| | - Maria Colomé-Tatché
- Institute of Computational Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany.
- Biomedical Center (BMC), Physiological Chemistry, Faculty of Medicine, LMU Munich, Planegg-Martinsried, Germany.
| |
Collapse
|
17
|
Logotheti S, Papadaki E, Zolota V, Logothetis C, Vrahatis AG, Soundararajan R, Tzelepi V. Lineage Plasticity and Stemness Phenotypes in Prostate Cancer: Harnessing the Power of Integrated "Omics" Approaches to Explore Measurable Metrics. Cancers (Basel) 2023; 15:4357. [PMID: 37686633 PMCID: PMC10486655 DOI: 10.3390/cancers15174357] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 08/21/2023] [Accepted: 08/25/2023] [Indexed: 09/10/2023] Open
Abstract
Prostate cancer (PCa), the most frequent and second most lethal cancer type in men in developed countries, is a highly heterogeneous disease. PCa heterogeneity, therapy resistance, stemness, and lethal progression have been attributed to lineage plasticity, which refers to the ability of neoplastic cells to undergo phenotypic changes under microenvironmental pressures by switching between developmental cell states. What remains to be elucidated is how to identify measurements of lineage plasticity, how to implement them to inform preclinical and clinical research, and, further, how to classify patients and inform therapeutic strategies in the clinic. Recent research has highlighted the crucial role of next-generation sequencing technologies in identifying potential biomarkers associated with lineage plasticity. Here, we review the genomic, transcriptomic, and epigenetic events that have been described in PCa and highlight those with significance for lineage plasticity. We further focus on their relevance in PCa research and their benefits in PCa patient classification. Finally, we explore ways in which bioinformatic analyses can be used to determine lineage plasticity based on large omics analyses and algorithms that can shed light on upstream and downstream events. Most importantly, an integrated multiomics approach may soon allow for the identification of a lineage plasticity signature, which would revolutionize the molecular classification of PCa patients.
Collapse
Affiliation(s)
- Souzana Logotheti
- Department of Pathology, University of Patras, 26504 Patras, Greece; (S.L.); (E.P.); (V.Z.)
| | - Eugenia Papadaki
- Department of Pathology, University of Patras, 26504 Patras, Greece; (S.L.); (E.P.); (V.Z.)
- Department of Informatics, Ionian University, 49100 Corfu, Greece;
| | - Vasiliki Zolota
- Department of Pathology, University of Patras, 26504 Patras, Greece; (S.L.); (E.P.); (V.Z.)
| | - Christopher Logothetis
- Department of Genitourinary Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA;
| | | | - Rama Soundararajan
- Department of Translational Molecular Pathology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Vasiliki Tzelepi
- Department of Pathology, University of Patras, 26504 Patras, Greece; (S.L.); (E.P.); (V.Z.)
| |
Collapse
|
18
|
Perez-Rodriguez D, Kalyva M, Santucci C, Proukakis C. Somatic CNV Detection by Single-Cell Whole-Genome Sequencing in Postmortem Human Brain. Methods Mol Biol 2023; 2561:205-230. [PMID: 36399272 DOI: 10.1007/978-1-0716-2655-9_11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
The evidence for a role of somatic mutations, including copy-number variants (CNVs), in neurodegeneration has increased in the last decade. However, the understanding of the types and origins of these mutations, and their exact contributions to disease onset and progression, is still in its infancy. The use of single-cell (or nuclear) whole-genome sequencing (scWGS) has emerged as a powerful tool to answer these questions. In the present chapter, we provide laboratory and bioinformatic protocols used successfully in our lab to detect megabase-scale CNVs in single cells from multiple system atrophy (MSA) human postmortem brains, using immunolabeling prior to selection of nuclei for whole-genome amplification (WGA). We also present an unpublished comparison of scWGS generated from the same control substantia nigra (SN) sample, using the latest versions of popular WGA chemistries, MDA and PicoPLEX. We have used this protocol to focus on brain cell types most relevant to synucleinopathies (dopaminergic [DA] neurons in Parkinson's disease [PD] and oligodendrocytes in MSA), but it can be applied to any tissue and/or cell type with appropriate markers.
Collapse
Affiliation(s)
- Diego Perez-Rodriguez
- Department of Clinical and Movement Neurosciences, Queen Square Institute of Neurology, University College London, London, UK
| | - Maria Kalyva
- Department of Clinical and Movement Neurosciences, Queen Square Institute of Neurology, University College London, London, UK
| | - Catherine Santucci
- Department of Clinical and Movement Neurosciences, Queen Square Institute of Neurology, University College London, London, UK
| | - Christos Proukakis
- Department of Clinical and Movement Neurosciences, Queen Square Institute of Neurology, University College London, London, UK.
| |
Collapse
|
19
|
Somatic copy number variant load in neurons of healthy controls and Alzheimer's disease patients. Acta Neuropathol Commun 2022; 10:175. [PMID: 36451207 PMCID: PMC9714068 DOI: 10.1186/s40478-022-01452-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Accepted: 09/27/2022] [Indexed: 12/03/2022] Open
Abstract
The possible role of somatic copy number variations (CNVs) in Alzheimer's disease (AD) aetiology has been controversial. Although cytogenetic studies suggested increased CNV loads in AD brains, a recent single-cell whole-genome sequencing (scWGS) experiment, studying frontal cortex brain samples, found no such evidence. Here we readdressed this issue using low-coverage scWGS on pyramidal neurons dissected via both laser capture microdissection (LCM) and fluorescence activated cell sorting (FACS) across five brain regions: entorhinal cortex, temporal cortex, hippocampal CA1, hippocampal CA3, and the cerebellum. Among reliably detected somatic CNVs identified in 1301 cells obtained from the brains of 13 AD patients and 7 healthy controls, deletions were more frequent compared to duplications. Interestingly, we observed slightly higher frequencies of CNV events in cells from AD compared to similar numbers of cells from controls (4.1% vs. 1.4%, or 0.9% vs. 0.7%, using different filtering approaches), although the differences were not statistically significant. On the technical aspects, we observed that LCM-isolated cells show higher within-cell read depth variation compared to cells isolated with FACS. To reduce within-cell read depth variation, we proposed a principal component analysis-based denoising approach that significantly improves signal-to-noise ratios. Lastly, we showed that LCM-isolated neurons in AD harbour slightly more read depth variability than neurons of controls, which might be related to the reported hyperploid profiles of some AD-affected neurons.
Collapse
|
20
|
Ruohan W, Yuwei Z, Mengbo W, Xikang F, Jianping W, Shuai Cheng L. Resolving single-cell copy number profiling for large datasets. Brief Bioinform 2022; 23:6633647. [PMID: 35801503 DOI: 10.1093/bib/bbac264] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Revised: 05/29/2022] [Accepted: 06/06/2022] [Indexed: 11/14/2022] Open
Abstract
The advances of single-cell DNA sequencing (scDNA-seq) enable us to characterize the genetic heterogeneity of cancer cells. However, the high noise and low coverage of scDNA-seq impede the estimation of copy number variations (CNVs). In addition, existing tools suffer from intensive execution time and often fail on large datasets. Here, we propose SeCNV, an efficient method that leverages structural entropy, to profile the copy numbers. SeCNV adopts a local Gaussian kernel to construct a matrix, depth congruent map (DCM), capturing the similarities between any two bins along the genome. Then, SeCNV partitions the genome into segments by minimizing the structural entropy from the DCM. With the partition, SeCNV estimates the copy numbers within each segment for cells. We simulate nine datasets with various breakpoint distributions and amplitudes of noise to benchmark SeCNV. SeCNV achieves a robust performance, i.e. the F1-scores are higher than 0.95 for breakpoint detections, significantly outperforming state-of-the-art methods. SeCNV successfully processes large datasets (>50 000 cells) within 4 min, while other tools fail to finish within the time limit, i.e. 120 h. We apply SeCNV to single-nucleus sequencing datasets from two breast cancer patients and acoustic cell tagmentation sequencing datasets from eight breast cancer patients. SeCNV successfully reproduces the distinct subclones and infers tumor heterogeneity. SeCNV is available at https://github.com/deepomicslab/SeCNV.
Collapse
Affiliation(s)
- Wang Ruohan
- Department of Computer Science at City University of Hong Kong
| | - Zhang Yuwei
- Department of Computer Science at City University of Hong Kong
| | - Wang Mengbo
- Department of Computer Science at City University of Hong Kong
| | - Feng Xikang
- School of Software, Northwestern Polytechnical University
| | - Wang Jianping
- Department of Computer Science at City University of Hong Kong
| | - Li Shuai Cheng
- Department of Computer Science at City University of Hong Kong
| |
Collapse
|
21
|
Mallory XF, Nakhleh L. SimSCSnTree: a simulator of single-cell DNA sequencing data. Bioinformatics 2022; 38:2912-2914. [PMID: 35561189 DOI: 10.1093/bioinformatics/btac169] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2021] [Revised: 02/15/2022] [Accepted: 03/18/2022] [Indexed: 11/13/2022] Open
Abstract
SUMMARY We report on a new single-cell DNA sequence simulator, SimSCSnTree, which generates an evolutionary tree of cells and evolves single nucleotide variants (SNVs) and copy number aberrations (CNAs) along its branches. Data generated by the simulator can be used to benchmark tools for single-cell genomic analyses, particularly in cancer where SNVs and CNAs are ubiquitous. AVAILABILITY AND IMPLEMENTATION SimSCSnTree is now on BioConda and also is freely available for download at https://github.com/compbiofan/SimSCSnTree.git with detailed documentation.
Collapse
Affiliation(s)
- Xian Fan Mallory
- Department of Computer Science, Florida State University, Tallahassee, FL 32306, USA
| | - Luay Nakhleh
- Department of Computer Science, Rice University, Houston, TX 77025, USA
| |
Collapse
|
22
|
Jia Q, Chu H, Jin Z, Long H, Zhu B. High-throughput single-сell sequencing in cancer research. Signal Transduct Target Ther 2022; 7:145. [PMID: 35504878 PMCID: PMC9065032 DOI: 10.1038/s41392-022-00990-4] [Citation(s) in RCA: 49] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2022] [Revised: 03/23/2022] [Accepted: 04/08/2022] [Indexed: 12/22/2022] Open
Abstract
With advances in sequencing and instrument technology, bioinformatics analysis is being applied to batches of massive cells at single-cell resolution. High-throughput single-cell sequencing can be utilized for multi-omics characterization of tumor cells, stromal cells or infiltrated immune cells to evaluate tumor progression, responses to environmental perturbations, heterogeneous composition of the tumor microenvironment, and complex intercellular interactions between these factors. Particularly, single-cell sequencing of T cell receptors, alone or in combination with single-cell RNA sequencing, is useful in the fields of tumor immunology and immunotherapy. Clinical insights obtained from single-cell analysis are critically important for exploring the biomarkers of disease progression or antitumor treatment, as well as for guiding precise clinical decision-making for patients with malignant tumors. In this review, we summarize the clinical applications of single-cell sequencing in the fields of tumor cell evolution, tumor immunology, and tumor immunotherapy. Additionally, we analyze the tumor cell response to antitumor treatment, heterogeneity of the tumor microenvironment, and response or resistance to immune checkpoint immunotherapy. The limitations of single-cell analysis in cancer research are also discussed.
Collapse
Affiliation(s)
- Qingzhu Jia
- Institute of Cancer, Xinqiao Hospital, Army Medical University, Chongqing, 400037, China.,Chongqing Key Laboratory of Immunotherapy, Chongqing, 400037, China
| | - Han Chu
- Institute of Cancer, Xinqiao Hospital, Army Medical University, Chongqing, 400037, China.,Center of Growth, Metabolism and Aging, Key Laboratory of Bio-Resources and Eco-Environment, College of Life Sciences, Sichuan University, Chengdu, 610064, China
| | - Zheng Jin
- Research Institute, GloriousMed Clinical Laboratory Co., Ltd, Shanghai, 201318, China
| | - Haixia Long
- Institute of Cancer, Xinqiao Hospital, Army Medical University, Chongqing, 400037, China. .,Chongqing Key Laboratory of Immunotherapy, Chongqing, 400037, China.
| | - Bo Zhu
- Institute of Cancer, Xinqiao Hospital, Army Medical University, Chongqing, 400037, China. .,Chongqing Key Laboratory of Immunotherapy, Chongqing, 400037, China.
| |
Collapse
|
23
|
Yang L, Gao Y, Oswalt A, Fang L, Boschiero C, Neupane M, Sattler CG, Li CJ, Seroussi E, Xu L, Yang L, Li L, Zhang H, Rosen BD, Van Tassell CP, Zhou Y, Ma L, Liu GE. Towards the detection of copy number variation from single sperm sequencing in cattle. BMC Genomics 2022; 23:215. [PMID: 35300589 PMCID: PMC8928590 DOI: 10.1186/s12864-022-08441-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Accepted: 10/15/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Copy number variation (CNV) has been routinely studied using bulk-cell sequencing. However, CNV is not well studied on the single-cell level except for humans and a few model organisms. RESULTS We sequenced 143 single sperms of two Holstein bulls, from which we predicted CNV events using 14 single sperms with deep sequencing. We then compared the CNV results derived from single sperms with the bulk-cell sequencing of one bull's family trio of diploid genomes. As a known CNV hotspot, segmental duplications were also predicted using the bovine ARS-UCD1.2 genome. Although the trio CNVs validated only some single sperm CNVs, they still showed a distal chromosomal distribution pattern and significant associations with segmental duplications and satellite repeats. CONCLUSION Our preliminary results pointed out future research directions and highlighted the importance of uniform whole genome amplification, deep sequence coverage, and dedicated software pipelines for CNV detection using single cell sequencing data.
Collapse
Affiliation(s)
- Liu Yang
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, 20705, USA.,College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, 611130, China
| | - Yahui Gao
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, 20705, USA.,Department of Animal and Avian Sciences, University of Maryland, College Park, MD, 20742, USA
| | - Adam Oswalt
- Select Sires Inc, 11740 U.S. 42 North, Plain City, OH, 43064, USA
| | - Lingzhao Fang
- MRC Human Genetics Unit at the Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh, EH4 2XU, UK
| | - Clarissa Boschiero
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, 20705, USA
| | - Mahesh Neupane
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, 20705, USA
| | | | - Cong-Jun Li
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, 20705, USA
| | - Eyal Seroussi
- Agricultural Research Organization (ARO), Institute of Animal Science, HaMaccabim Road, P.O.B 15159, 7528809, Volcani CenterRishon LeTsiyon, Israel
| | - Lingyang Xu
- Innovation Team of Cattle Genetic Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Lv Yang
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education & College of Animal Science and Technology, Huazhong Agricultural University, Wuhan, 430070, China
| | - Li Li
- College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, 611130, China
| | - Hongping Zhang
- College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, 611130, China
| | - Benjamin D Rosen
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, 20705, USA
| | - Curtis P Van Tassell
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, 20705, USA
| | - Yang Zhou
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education & College of Animal Science and Technology, Huazhong Agricultural University, Wuhan, 430070, China
| | - Li Ma
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD, 20742, USA.
| | - George E Liu
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, 20705, USA.
| |
Collapse
|
24
|
Kozlov A, Alves JM, Stamatakis A, Posada D. CellPhy: accurate and fast probabilistic inference of single-cell phylogenies from scDNA-seq data. Genome Biol 2022; 23:37. [PMID: 35081992 PMCID: PMC8790911 DOI: 10.1186/s13059-021-02583-w] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Accepted: 12/20/2021] [Indexed: 01/15/2023] Open
Abstract
We introduce CellPhy, a maximum likelihood framework for inferring phylogenetic trees from somatic single-cell single-nucleotide variants. CellPhy leverages a finite-site Markov genotype model with 16 diploid states and considers amplification error and allelic dropout. We implement CellPhy into RAxML-NG, a widely used phylogenetic inference package that provides statistical confidence measurements and scales well on large datasets with hundreds or thousands of cells. Comprehensive simulations suggest that CellPhy is more robust to single-cell genomics errors and outperforms state-of-the-art methods under realistic scenarios, both in accuracy and speed. CellPhy is freely available at https://github.com/amkozlov/cellphy .
Collapse
Affiliation(s)
- Alexey Kozlov
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, 69118 Heidelberg, Germany
- Institute for Theoretical Informatics, Karlsruhe Institute of Technology, 76128 Karlsruhe, Germany
| | - Joao M. Alves
- CINBIO, Universidade de Vigo, 36310 Vigo, Spain
- Department of Biochemistry, Genetics, and Immunology, Universidade de Vigo, 36310 Vigo, Spain
- Galicia Sur Health Research Institute (IIS Galicia Sur), SERGAS-UVIGO, Vigo, Spain
| | - Alexandros Stamatakis
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, 69118 Heidelberg, Germany
| | - David Posada
- CINBIO, Universidade de Vigo, 36310 Vigo, Spain
- Department of Biochemistry, Genetics, and Immunology, Universidade de Vigo, 36310 Vigo, Spain
- Galicia Sur Health Research Institute (IIS Galicia Sur), SERGAS-UVIGO, Vigo, Spain
| |
Collapse
|
25
|
Reconstructing and counting genomic fragments through tagmentation-based haploid phasing. Sci Rep 2021; 11:18907. [PMID: 34556684 PMCID: PMC8460729 DOI: 10.1038/s41598-021-97852-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Accepted: 08/27/2021] [Indexed: 11/16/2022] Open
Abstract
Single-cell sequencing provides a new level of granularity in studying the heterogeneous nature of cancer cells. For some cancers, this heterogeneity is the result of copy number changes of genes within the cellular genomes. The ability to accurately determine such copy number changes is critical in tracing and understanding tumorigenesis. Current single-cell genome sequencing methodologies infer copy numbers based on statistical approaches followed by rounding decimal numbers to integer values. Such methodologies are sample dependent, have varying calling sensitivities which heavily depend on the sample’s ploidy and are sensitive to noise in sequencing data. In this paper we have demonstrated the concept of integer-counting by using a novel bioinformatic algorithm built on our library construction chemistry in order to detect the discrete nature of the genome.
Collapse
|
26
|
Montemurro M, Grassi E, Pizzino CG, Bertotti A, Ficarra E, Urgese G. PhyliCS: a Python library to explore scCNA data and quantify spatial tumor heterogeneity. BMC Bioinformatics 2021; 22:360. [PMID: 34217219 PMCID: PMC8254361 DOI: 10.1186/s12859-021-04277-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Accepted: 06/21/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Tumors are composed by a number of cancer cell subpopulations (subclones), characterized by a distinguishable set of mutations. This phenomenon, known as intra-tumor heterogeneity (ITH), may be studied using Copy Number Aberrations (CNAs). Nowadays ITH can be assessed at the highest possible resolution using single-cell DNA (scDNA) sequencing technology. Additionally, single-cell CNA (scCNA) profiles from multiple samples of the same tumor can in principle be exploited to study the spatial distribution of subclones within a tumor mass. However, since the technology required to generate large scDNA sequencing datasets is relatively recent, dedicated analytical approaches are still lacking. RESULTS We present PhyliCS, the first tool which exploits scCNA data from multiple samples from the same tumor to estimate whether the different clones of a tumor are well mixed or spatially separated. Starting from the CNA data produced with third party instruments, it computes a score, the Spatial Heterogeneity score, aimed at distinguishing spatially intermixed cell populations from spatially segregated ones. Additionally, it provides functionalities to facilitate scDNA analysis, such as feature selection and dimensionality reduction methods, visualization tools and a flexible clustering module. CONCLUSIONS PhyliCS represents a valuable instrument to explore the extent of spatial heterogeneity in multi-regional tumour sampling, exploiting the potential of scCNA data.
Collapse
Affiliation(s)
- Marilisa Montemurro
- Department of Control and Computer Science, Politecnico di Torino, C.so Duca degli Abruzzi 24, 10129, Turin, Italy.
| | - Elena Grassi
- Department of Oncology, University of Torino, Strada Provinciale, 142 - KM 3.95, 10060, Candiolo, Turin, Italy.,Candiolo Cancer Institute - FPO IRCCS, Strada Provinciale, 142 - KM 3.95, 10060, Candiolo, TO, Italy
| | - Carmelo Gabriele Pizzino
- Department of Oncology, University of Torino, Strada Provinciale, 142 - KM 3.95, 10060, Candiolo, Turin, Italy.,Candiolo Cancer Institute - FPO IRCCS, Strada Provinciale, 142 - KM 3.95, 10060, Candiolo, TO, Italy
| | - Andrea Bertotti
- Department of Oncology, University of Torino, Strada Provinciale, 142 - KM 3.95, 10060, Candiolo, Turin, Italy.,Candiolo Cancer Institute - FPO IRCCS, Strada Provinciale, 142 - KM 3.95, 10060, Candiolo, TO, Italy
| | - Elisa Ficarra
- Enzo Ferrari Engineering Dept, University of Modena and Reggio Emilia, Via Vivarelli 10/1, 41125, Modena, Italy
| | - Gianvito Urgese
- Interuniversity Department of Regional and Urban Studies and Planning, Politecnico di Torino, C.so Duca degli Abruzzi 24, 10129, Turin, Italy
| |
Collapse
|
27
|
Prasad S, Rankine A, Prasad T, Song P, Dokukin ME, Makarova N, Backman V, Sokolov I. Atomic Force Microscopy Detects the Difference in Cancer Cells of Different Neoplastic Aggressiveness via Machine Learning. ADVANCED NANOBIOMED RESEARCH 2021. [DOI: 10.1002/anbr.202000116] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Affiliation(s)
- Siona Prasad
- Department of Mechanical Engineering Tufts University Medford MA 02155 USA
- Department of Computer Science Harvard University Cambridge MA 02138 USA
| | - Alex Rankine
- Department of Mechanical Engineering Tufts University Medford MA 02155 USA
- Department of Computer Science Harvard University Cambridge MA 02138 USA
| | - Tarun Prasad
- Department of Mechanical Engineering Tufts University Medford MA 02155 USA
- Department of Computer Science Harvard University Cambridge MA 02138 USA
| | - Patrick Song
- Department of Mechanical Engineering Tufts University Medford MA 02155 USA
- Department of Computer Science Harvard University Cambridge MA 02138 USA
| | - Maxim E. Dokukin
- NanoScience Solutions, Inc Arlington VA 22203 USA
- Department of Information Technology and Electronics Sarov Physics and Technology Institute Sarov Russian Federation
- Institute of Nanoengineering in Electronics, Spintronics and Photonics National Research Nuclear University MEPhI Moscow Russian Federation
| | - Nadezda Makarova
- Department of Mechanical Engineering Tufts University Medford MA 02155 USA
| | - Vadim Backman
- Department of Biomedical Engineering Northwestern University Evanston IL 60208 USA
| | - Igor Sokolov
- Department of Mechanical Engineering Tufts University Medford MA 02155 USA
- Department of Biomedical Engineering Tufts University Medford MA 02155 USA
- Department of Physics Tufts University Medford MA 02155 USA
| |
Collapse
|
28
|
Afrin K, Iquebal AS, Karimi M, Souris A, Lee SY, Mallick BK. Directionally dependent multi-view clustering using copula model. PLoS One 2020; 15:e0238996. [PMID: 33095785 PMCID: PMC7584221 DOI: 10.1371/journal.pone.0238996] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2020] [Accepted: 08/27/2020] [Indexed: 11/23/2022] Open
Abstract
Recent developments in high-throughput methods have resulted in the collection of high-dimensional data types from multiple sources and technologies that measure distinct yet complementary information. Integrated clustering of such multiple data types or multi-view clustering is critical for revealing pathological insights. However, multi-view clustering is challenging due to the complex dependence structure between multiple data types, including directional dependency. Specifically, genomics data types have pre-specified directional dependencies known as the central dogma that describes the process of information flow from DNA to messenger RNA (mRNA) and then from mRNA to protein. Most of the existing multi-view clustering approaches assume an independent structure or pair-wise (non-directional) dependence between data types, thereby ignoring their directional relationship. Motivated by this, we propose a biology-inspired Bayesian integrated multi-view clustering model that uses an asymmetric copula to accommodate the directional dependencies between the data types. Via extensive simulation experiments, we demonstrate the negative impact of ignoring directional dependency on clustering performance. We also present an application of our model to a real-world dataset of breast cancer tumor samples collected from The Cancer Genome Altas program and provide comparative results.
Collapse
Affiliation(s)
- Kahkashan Afrin
- Department of Industrial & Systems Engineering, Texas A&M University, College Station, TX, United States of America
| | - Ashif S. Iquebal
- Department of Industrial & Systems Engineering, Texas A&M University, College Station, TX, United States of America
| | - Mostafa Karimi
- Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX, United States of America
| | - Allyson Souris
- Department of Statistics, Texas A&M University, College Station, TX, United States of America
| | - Se Yoon Lee
- Department of Statistics, Texas A&M University, College Station, TX, United States of America
| | - Bani K. Mallick
- Department of Statistics, Texas A&M University, College Station, TX, United States of America
| |
Collapse
|
29
|
Mallory XF, Edrisi M, Navin N, Nakhleh L. Methods for copy number aberration detection from single-cell DNA-sequencing data. Genome Biol 2020; 21:208. [PMID: 32807205 PMCID: PMC7433197 DOI: 10.1186/s13059-020-02119-8] [Citation(s) in RCA: 77] [Impact Index Per Article: 15.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2020] [Accepted: 07/23/2020] [Indexed: 02/06/2023] Open
Abstract
Copy number aberrations (CNAs), which are pathogenic copy number variations (CNVs), play an important role in the initiation and progression of cancer. Single-cell DNA-sequencing (scDNAseq) technologies produce data that is ideal for inferring CNAs. In this review, we review eight methods that have been developed for detecting CNAs in scDNAseq data, and categorize them according to the steps of a seven-step pipeline that they employ. Furthermore, we review models and methods for evolutionary analyses of CNAs from scDNAseq data and highlight advances and future research directions for computational methods for CNA detection from scDNAseq data.
Collapse
Affiliation(s)
- Xian F. Mallory
- Department of Computer Science, Rice University, Houston, TX USA
- Department of Computer Science, Florida State University, Tallahassee, FL USA
| | | | - Nicholas Navin
- Department of Genetics, the University of Texas M.D. Anderson Cancer Center, Houston, TX USA
| | - Luay Nakhleh
- Department of Computer Science, Rice University, Houston, TX USA
| |
Collapse
|