1
|
Li Q, Keskus AG, Wagner J, Izydorczyk MB, Timp W, Sedlazeck FJ, Klein AP, Zook JM, Kolmogorov M, Schatz MC. Unraveling the hidden complexity of cancer through long-read sequencing. Genome Res 2025; 35:599-620. [PMID: 40113261 PMCID: PMC12047254 DOI: 10.1101/gr.280041.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/22/2025]
Abstract
Cancer is fundamentally a disease of the genome, characterized by extensive genomic, transcriptomic, and epigenomic alterations. Most current studies predominantly use short-read sequencing, gene panels, or microarrays to explore these alterations; however, these technologies can systematically miss or misrepresent certain types of alterations, especially structural variants, complex rearrangements, and alterations within repetitive regions. Long-read sequencing is rapidly emerging as a transformative technology for cancer research by providing a comprehensive view across the genome, transcriptome, and epigenome, including the ability to detect alterations that previous technologies have overlooked. In this Perspective, we explore the current applications of long-read sequencing for both germline and somatic cancer analysis. We provide an overview of the computational methodologies tailored to long-read data and highlight key discoveries and resources within cancer genomics that were previously inaccessible with prior technologies. We also address future opportunities and persistent challenges, including the experimental and computational requirements needed to scale to larger sample sizes, the hurdles in sequencing and analyzing complex cancer genomes, and opportunities for leveraging machine learning and artificial intelligence technologies for cancer informatics. We further discuss how the telomere-to-telomere genome and the emerging human pangenome could enhance the resolution of cancer genome analysis, potentially revolutionizing early detection and disease monitoring in patients. Finally, we outline strategies for transitioning long-read sequencing from research applications to routine clinical practice.
Collapse
Affiliation(s)
- Qiuhui Li
- Department of Computer Science, Johns Hopkins University, Baltimore, Maryland 21218, USA
| | - Ayse G Keskus
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, Bethesda, Maryland 20892, USA
| | - Justin Wagner
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, Maryland 20899, USA
| | - Michal B Izydorczyk
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Winston Timp
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland 21218, USA
| | - Fritz J Sedlazeck
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Texas 77030, USA
- Department of Computer Science, Rice University, Houston, Texas 77251, USA
| | - Alison P Klein
- Sidney Kimmel Comprehensive Cancer Center, Department of Oncology, Johns Hopkins Medicine, Baltimore, Maryland 21031, USA
| | - Justin M Zook
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, Maryland 20899, USA
| | - Mikhail Kolmogorov
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, Bethesda, Maryland 20892, USA;
| | - Michael C Schatz
- Department of Computer Science, Johns Hopkins University, Baltimore, Maryland 21218, USA;
- Sidney Kimmel Comprehensive Cancer Center, Department of Oncology, Johns Hopkins Medicine, Baltimore, Maryland 21031, USA
| |
Collapse
|
2
|
Koeppel J, Ferreira R, Vanderstichele T, Riedmayr LM, Peets EM, Girling G, Weller J, Murat P, Liberante FG, Ellis T, Church GM, Parts L. Randomizing the human genome by engineering recombination between repeat elements. Science 2025; 387:eado3979. [PMID: 39883775 DOI: 10.1126/science.ado3979] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Accepted: 08/09/2024] [Indexed: 02/01/2025]
Abstract
We lack tools to edit DNA sequences at scales necessary to study 99% of the human genome that is noncoding. To address this gap, we applied CRISPR prime editing to insert recombination handles into repetitive sequences, up to 1697 per cell line, which enables generating large-scale deletions, inversions, translocations, and circular DNA. Recombinase induction produced more than 100 stochastic megabase-sized rearrangements in each cell. We tracked these rearrangements over time to measure selection pressures, finding a preference for shorter variants that avoided essential genes. We characterized 29 clones with multiple rearrangements, finding an impact of deletions on expression of genes in the variant but not on nearby genes. This genome-scrambling strategy enables large deletions, sequence relocations, and the insertion of regulatory elements to explore genome dispensability and organization.
Collapse
Affiliation(s)
| | - Raphael Ferreira
- Harvard Medical School, Department of Genetics, Boston, MA, USA
- Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA, USA
- Broad Institute of Massachusetts Institute of Technology and Harvard University, Cambridge, MA, USA
| | | | - Lisa Maria Riedmayr
- Harvard Medical School, Department of Genetics, Boston, MA, USA
- Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA, USA
| | | | | | | | | | | | - Tom Ellis
- Wellcome Sanger Institute, Hinxton, UK
- Department of Bioengineering, Imperial College London, London, UK
| | - George McDonald Church
- Harvard Medical School, Department of Genetics, Boston, MA, USA
- Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA, USA
- Harvard-MIT Program in Health Sciences and Technology, Cambridge, Massachusetts, USA
- Blavatnik Institute, Harvard Medical School, Boston, Massachusetts, USA
| | | |
Collapse
|
3
|
Koeppel J, Weller J, Vanderstichele T, Parts L. Engineering structural variants to interrogate genome function. Nat Genet 2024; 56:2623-2635. [PMID: 39533047 DOI: 10.1038/s41588-024-01981-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2024] [Accepted: 10/10/2024] [Indexed: 11/16/2024]
Abstract
Structural variation, such as deletions, duplications, inversions and complex rearrangements, can have profound effects on gene expression, genome stability, phenotypic diversity and disease susceptibility. Structural variants can encompass up to millions of bases and have the potential to rearrange substantial segments of the genome. They contribute considerably more to genetic diversity in human populations and have larger effects on phenotypic traits than point mutations. Until recently, our understanding of the effects of structural variants was driven mainly by studying naturally occurring variation. New genome-engineering tools capable of generating deletions, insertions, inversions and translocations, together with the discovery of new recombinases and advances in creating synthetic DNA constructs, now enable the design and generation of an extended range of structural variation. Here, we discuss these tools and examples of their application and highlight existing challenges that will need to be overcome to fully harness their potential.
Collapse
|
4
|
Brunette GJ, Tourdot RW, Wangsa D, Pellman D, Zhang CZ. Haplotype-resolved reconstruction and functional interrogation of cancer karyotypes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.02.583108. [PMID: 38496539 PMCID: PMC10942333 DOI: 10.1101/2024.03.02.583108] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]
Abstract
Genomic characterization has revealed widespread structural complexity in cancer karyotypes, however shotgun sequencing cannot resolve genomic rearrangements with chromosome-length continuity. Here, we describe a two-tiered approach to determine the segmental composition of rearranged chromosomes with haplotype resolution. First, we present refLinker , a new method for robust determination of chromosomal haplotypes using cancer Hi-C data. Second, we use haplotype-specific Hi-C contacts to determine the segmental structure of rearranged chromosomes. By contrast with existing methods for diploid haplotype inference, our approach is robust to the confounding effects of large-scale DNA deletions, duplications, and high-level amplification in cancer sequencing. Using this approach, we examine haplotype-specific expression changes on rearranged homologs and provide direct evidence for long-range transcriptional activation and repression associated with rearrangements of the inactive X chromosome (Xi). Together, these results reveal the significant transcriptional consequences of somatic Xi rearrangements, highlighting refLinker 's broad utility for studying the functional consequences of chromosomal rearrangements.
Collapse
|
5
|
Yoon I, Kim U, Jung KO, Song Y, Park T, Lee DS. 3C methods in cancer research: recent advances and future prospects. Exp Mol Med 2024; 56:788-798. [PMID: 38658701 PMCID: PMC11059347 DOI: 10.1038/s12276-024-01236-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2023] [Revised: 03/15/2024] [Accepted: 03/25/2024] [Indexed: 04/26/2024] Open
Abstract
In recent years, Hi-C technology has revolutionized cancer research by elucidating the mystery of three-dimensional chromatin organization and its role in gene regulation. This paper explored the impact of Hi-C advancements on cancer research by delving into high-resolution techniques, such as chromatin loops, structural variants, haplotype phasing, and extrachromosomal DNA (ecDNA). Distant regulatory elements interact with their target genes through chromatin loops. Structural variants contribute to the development and progression of cancer. Haplotype phasing is crucial for understanding allele-specific genomic rearrangements and somatic clonal evolution in cancer. The role of ecDNA in driving oncogene amplification and drug resistance in cancer cells has also been revealed. These innovations offer a deeper understanding of cancer biology and the potential for personalized therapies. Despite these advancements, challenges, such as the accurate mapping of repetitive sequences and precise identification of structural variants, persist. Integrating Hi-C with multiomics data is key to overcoming these challenges and comprehensively understanding complex cancer genomes. Thus, Hi-C is a powerful tool for guiding precision medicine in cancer research and treatment.
Collapse
Affiliation(s)
- Insoo Yoon
- Department of Life Science, University of Seoul, Seoul, 02504, Republic of Korea
| | - Uijin Kim
- Department of Life Science, University of Seoul, Seoul, 02504, Republic of Korea
| | - Kyung Oh Jung
- Department of Anatomy, College of Medicine, Chung-Ang University, Seoul, 06974, Republic of Korea
| | - Yousuk Song
- Department of Life Science, University of Seoul, Seoul, 02504, Republic of Korea
| | - Taesoo Park
- Department of Life Science, University of Seoul, Seoul, 02504, Republic of Korea
| | - Dong-Sung Lee
- Department of Life Science, University of Seoul, Seoul, 02504, Republic of Korea.
| |
Collapse
|