1
|
Li Q, Keskus AG, Wagner J, Izydorczyk MB, Timp W, Sedlazeck FJ, Klein AP, Zook JM, Kolmogorov M, Schatz MC. Unraveling the hidden complexity of cancer through long-read sequencing. Genome Res 2025; 35:599-620. [PMID: 40113261 PMCID: PMC12047254 DOI: 10.1101/gr.280041.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/22/2025]
Abstract
Cancer is fundamentally a disease of the genome, characterized by extensive genomic, transcriptomic, and epigenomic alterations. Most current studies predominantly use short-read sequencing, gene panels, or microarrays to explore these alterations; however, these technologies can systematically miss or misrepresent certain types of alterations, especially structural variants, complex rearrangements, and alterations within repetitive regions. Long-read sequencing is rapidly emerging as a transformative technology for cancer research by providing a comprehensive view across the genome, transcriptome, and epigenome, including the ability to detect alterations that previous technologies have overlooked. In this Perspective, we explore the current applications of long-read sequencing for both germline and somatic cancer analysis. We provide an overview of the computational methodologies tailored to long-read data and highlight key discoveries and resources within cancer genomics that were previously inaccessible with prior technologies. We also address future opportunities and persistent challenges, including the experimental and computational requirements needed to scale to larger sample sizes, the hurdles in sequencing and analyzing complex cancer genomes, and opportunities for leveraging machine learning and artificial intelligence technologies for cancer informatics. We further discuss how the telomere-to-telomere genome and the emerging human pangenome could enhance the resolution of cancer genome analysis, potentially revolutionizing early detection and disease monitoring in patients. Finally, we outline strategies for transitioning long-read sequencing from research applications to routine clinical practice.
Collapse
Affiliation(s)
- Qiuhui Li
- Department of Computer Science, Johns Hopkins University, Baltimore, Maryland 21218, USA
| | - Ayse G Keskus
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, Bethesda, Maryland 20892, USA
| | - Justin Wagner
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, Maryland 20899, USA
| | - Michal B Izydorczyk
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Winston Timp
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland 21218, USA
| | - Fritz J Sedlazeck
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Texas 77030, USA
- Department of Computer Science, Rice University, Houston, Texas 77251, USA
| | - Alison P Klein
- Sidney Kimmel Comprehensive Cancer Center, Department of Oncology, Johns Hopkins Medicine, Baltimore, Maryland 21031, USA
| | - Justin M Zook
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, Maryland 20899, USA
| | - Mikhail Kolmogorov
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, Bethesda, Maryland 20892, USA;
| | - Michael C Schatz
- Department of Computer Science, Johns Hopkins University, Baltimore, Maryland 21218, USA;
- Sidney Kimmel Comprehensive Cancer Center, Department of Oncology, Johns Hopkins Medicine, Baltimore, Maryland 21031, USA
| |
Collapse
|
2
|
Thomson A, Rehn J, Yeung D, Breen J, White D. Deciphering IGH rearrangement complexity and detection strategies in acute lymphoblastic leukaemia. NPJ Precis Oncol 2025; 9:99. [PMID: 40185891 PMCID: PMC11971345 DOI: 10.1038/s41698-025-00887-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2024] [Accepted: 03/19/2025] [Indexed: 04/07/2025] Open
Abstract
Acute lymphoblastic leukaemia is a highly heterogeneous malignancy characterised by various genomic alterations that influence disease progression and therapeutic outcomes. Gene fusions involving the immunoglobulin heavy chain gene represent a complex and diverse category. These fusions often result in enhancer hijacking, upregulation of partner proto-oncogenes and contribute to leukemogenesis. This review highlights the mechanisms underlying IGH gene fusions, the critical role they play in ALL pathogenesis, and current detection technologies.
Collapse
Affiliation(s)
- Ashlee Thomson
- Faculty of Health and Medical Sciences, University of Adelaide, Adelaide, SA, 5005, Australia.
- Blood Cancer Program, Precision Cancer Medicine Theme, South Australian Health & Medical Research Institute (SAHMRI), Adelaide, SA, 5000, Australia.
| | - Jacqueline Rehn
- Faculty of Health and Medical Sciences, University of Adelaide, Adelaide, SA, 5005, Australia
- Blood Cancer Program, Precision Cancer Medicine Theme, South Australian Health & Medical Research Institute (SAHMRI), Adelaide, SA, 5000, Australia
| | - David Yeung
- Faculty of Health and Medical Sciences, University of Adelaide, Adelaide, SA, 5005, Australia
- Blood Cancer Program, Precision Cancer Medicine Theme, South Australian Health & Medical Research Institute (SAHMRI), Adelaide, SA, 5000, Australia
- Haematology Department, Royal Adelaide Hospital and SA Pathology, Adelaide, SA, 5000, Australia
| | - James Breen
- Black Ochre Data Labs, Indigenous Genomics, The Kids Research Institute Australia, Adelaide, SA, 5000, Australia
- James Curtin School of Medical Research, Australian National University, Canberra, ACT, 2601, Australia
| | - Deborah White
- Faculty of Health and Medical Sciences, University of Adelaide, Adelaide, SA, 5005, Australia.
- Blood Cancer Program, Precision Cancer Medicine Theme, South Australian Health & Medical Research Institute (SAHMRI), Adelaide, SA, 5000, Australia.
- Australian and New Zealand Children's Oncology Group (ANZCHOG), Clayton, VIC, 3168, Australia.
- Australian Genomics Health Alliance (AGHA), The Murdoch Children's Research Institute, Parkville, VIC, 3052, Australia.
| |
Collapse
|
3
|
Han H, Lee HH, Kim MG, Shin YS, Chung JS, Kim J. Genome assembly resources of genitourinary cancers for chromosomal aberration at the single nucleotide level. Sci Data 2025; 12:550. [PMID: 40169664 PMCID: PMC11962096 DOI: 10.1038/s41597-025-04801-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Accepted: 03/11/2025] [Indexed: 04/03/2025] Open
Abstract
Traditionally, the evolutionary perspective of cancer has been understood as gradual alterations in passenger/driver genes that lead to branching phylogeny. However, in cases of prostate adenocarcinoma and kidney renal cell carcinoma, macroevolutionary landmarks like chromoplexy and chromothripsis are frequently observed. Unfortunately, short-read sequencing techniques often miss these significant macroevolutionary changes, which involve multiple translocations and deletions at the chromosomal level. To resolve such genomic dark matters, we provided high-fidelity long-read sequencing data (78-92 Gb of ~Q30 reads) of six genitourinary tumour cell lines (one benign kidney tumour and two kidney and three prostate cancers). Based on these data, we obtained 12 high-quality, partially phased genome assemblies (Contig N50 1.85-29.01 Mb; longest contig 2.02-171.62 Mb), graph-based pan-genome variant sets (11.57 M variants including 60 K structural variants), and 5-methylcytosine sites (14.68%-27.05% of the CpG sites). We also identified several severe chromosome aberration events, which would result from chromosome break and fusion events. Our cancer genome assemblies will provide unprecedented resolution to understand cancer genome instability and chromosomal aberration.
Collapse
Affiliation(s)
- Hyunho Han
- Department of Urology, Urological Science Institute, Yonsei University College of Medicine, Seoul, Republic of Korea.
| | - Hyung Ho Lee
- Center for Urologic Cancer, National Cancer Center, Goyang, Republic of Korea
| | - Min Gyu Kim
- Center for Urologic Cancer, National Cancer Center, Goyang, Republic of Korea
| | - Yoo Sub Shin
- Department of Urology, Urological Science Institute, Yonsei University College of Medicine, Seoul, Republic of Korea
| | - Jin Soo Chung
- Center for Urologic Cancer, National Cancer Center, Goyang, Republic of Korea.
| | - Jun Kim
- Department of Convergent Bioscience and Informatics, College of Bioscience and Biotechnology, Chungnam National University, Daejeon, 34134, Korea.
| |
Collapse
|
4
|
An Z, Jiang A, Chen J. Toward understanding the role of genomic repeat elements in neurodegenerative diseases. Neural Regen Res 2025; 20:646-659. [PMID: 38886931 PMCID: PMC11433896 DOI: 10.4103/nrr.nrr-d-23-01568] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Revised: 12/21/2023] [Accepted: 03/02/2024] [Indexed: 06/20/2024] Open
Abstract
Neurodegenerative diseases cause great medical and economic burdens for both patients and society; however, the complex molecular mechanisms thereof are not yet well understood. With the development of high-coverage sequencing technology, researchers have started to notice that genomic repeat regions, previously neglected in search of disease culprits, are active contributors to multiple neurodegenerative diseases. In this review, we describe the association between repeat element variants and multiple degenerative diseases through genome-wide association studies and targeted sequencing. We discuss the identification of disease-relevant repeat element variants, further powered by the advancement of long-read sequencing technologies and their related tools, and summarize recent findings in the molecular mechanisms of repeat element variants in brain degeneration, such as those causing transcriptional silencing or RNA-mediated gain of toxic function. Furthermore, we describe how in silico predictions using innovative computational models, such as deep learning language models, could enhance and accelerate our understanding of the functional impact of repeat element variants. Finally, we discuss future directions to advance current findings for a better understanding of neurodegenerative diseases and the clinical applications of genomic repeat elements.
Collapse
Affiliation(s)
- Zhengyu An
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
| | - Aidi Jiang
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
| | - Jingqi Chen
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
- MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China
- MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, Fudan University, Shanghai, China
- Zhangjiang Fudan International Innovation Center, Shanghai, China
| |
Collapse
|
5
|
Yang S, Seo J, Choi J, Kim SH, Kuk Y, Park KC, Kang M, Byun S, Joo JY. Towards understanding cancer dormancy over strategic hitching up mechanisms to technologies. Mol Cancer 2025; 24:47. [PMID: 39953555 PMCID: PMC11829473 DOI: 10.1186/s12943-025-02250-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2024] [Accepted: 01/28/2025] [Indexed: 02/17/2025] Open
Abstract
Delving into cancer dormancy has been an inherent task that may drive the lethal recurrence of cancer after primary tumor relief. Cells in quiescence can survive for a short or long term in silence, may undergo genetic or epigenetic changes, and can initiate relapse through certain contextual cues. The state of dormancy can be induced by multiple conditions including cancer drug treatment, in turn, undergoes a life cycle that generally occurs through dissemination, invasion, intravasation, circulation, immune evasion, extravasation, and colonization. Throughout this cascade, a cellular machinery governs the fate of individual cells, largely affected by gene regulation. Despite its significance, a precise view of cancer dormancy is yet hampered. Revolutionizing advanced single cell and long read sequencing through analysis methodologies and artificial intelligence, the most recent stage in the research tool progress, is expected to provide a holistic view of the diverse aspects of cancer dormancy.
Collapse
Affiliation(s)
- Sumin Yang
- Department of Pharmacy, College of Pharmacy, Hanyang University, Ansan, Gyeonggi-do, 15588, Korea
| | - Jieun Seo
- Genomic Medicine Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Korea
- Department of Functional Genomics, University of Science and Technology, Daejeon, 34113, Korea
| | - Jeonghyeon Choi
- Department of Pharmacy, College of Pharmacy, Hanyang University, Ansan, Gyeonggi-do, 15588, Korea
| | - Sung-Hyun Kim
- Department of Pharmacy, College of Pharmacy, Hanyang University, Ansan, Gyeonggi-do, 15588, Korea
| | - Yunmin Kuk
- Genomic Medicine Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Korea
- Department of Functional Genomics, University of Science and Technology, Daejeon, 34113, Korea
| | - Kyung Chan Park
- Genomic Medicine Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Korea
- Department of Functional Genomics, University of Science and Technology, Daejeon, 34113, Korea
| | - Mingon Kang
- Department of Computer Science, University of Nevada, Las Vegas, NV, 89154, USA
| | - Sangwon Byun
- Genomic Medicine Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Korea.
- Department of Functional Genomics, University of Science and Technology, Daejeon, 34113, Korea.
| | - Jae-Yeol Joo
- Department of Pharmacy, College of Pharmacy, Hanyang University, Ansan, Gyeonggi-do, 15588, Korea.
- Department of Pharmacy, College of Pharmacy, Hanyang University, Rm 407, Bldg.42, 55 Hanyangdaehak-ro, Sangnok-gu Ansan, Gyeonggi-do, 15588, Republic of Korea.
| |
Collapse
|
6
|
Engelbrecht E, Rodriguez OL, Watson CT. Addressing Technical Pitfalls in Pursuit of Molecular Factors That Mediate Immunoglobulin Gene Regulation. JOURNAL OF IMMUNOLOGY (BALTIMORE, MD. : 1950) 2024; 213:651-662. [PMID: 39007649 PMCID: PMC11333172 DOI: 10.4049/jimmunol.2400131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2024] [Accepted: 06/13/2024] [Indexed: 07/16/2024]
Abstract
The expressed Ab repertoire is a critical determinant of immune-related phenotypes. Ab-encoding transcripts are distinct from other expressed genes because they are transcribed from somatically rearranged gene segments. Human Abs are composed of two identical H and L chain polypeptides derived from genes in IGH locus and one of two L chain loci. The combinatorial diversity that results from Ab gene rearrangement and the pairing of different H and L chains contributes to the immense diversity of the baseline Ab repertoire. During rearrangement, Ab gene selection is mediated by factors that influence chromatin architecture, promoter/enhancer activity, and V(D)J recombination. Interindividual variation in the composition of the Ab repertoire associates with germline variation in IGH, implicating polymorphism in Ab gene regulation. Determining how IGH variants directly mediate gene regulation will require integration of these variants with other functional genomic datasets. In this study, we argue that standard approaches using short reads have limited utility for characterizing regulatory regions in IGH at haplotype resolution. Using simulated and chromatin immunoprecipitation sequencing reads, we define features of IGH that limit use of short reads and a single reference genome, namely 1) the highly duplicated nature of the DNA sequence in IGH and 2) structural polymorphisms that are frequent in the population. We demonstrate that personalized diploid references enhance performance of short-read data for characterizing mappable portions of the locus, while also showing that long-read profiling tools will ultimately be needed to fully resolve functional impacts of IGH germline variation on expressed Ab repertoires.
Collapse
Affiliation(s)
- Eric Engelbrecht
- Department of Biochemistry and Molecular Genetics, University of Louisville, Louisville, KY
| | - Oscar L Rodriguez
- Department of Biochemistry and Molecular Genetics, University of Louisville, Louisville, KY
| | - Corey T Watson
- Department of Biochemistry and Molecular Genetics, University of Louisville, Louisville, KY
| |
Collapse
|
7
|
Lestringant V, Guermouche-Flament H, Jimenez-Pocquet M, Gaillard JB, Penther D. Cytogenetics in the management of hematological malignancies: An overview of alternative technologies for cytogenetic characterization. Curr Res Transl Med 2024; 72:103440. [PMID: 38447270 DOI: 10.1016/j.retram.2024.103440] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Revised: 12/22/2023] [Accepted: 01/11/2024] [Indexed: 03/08/2024]
Abstract
Genomic characterization is an essential part of the clinical management of hematological malignancies for diagnostic, prognostic and therapeutic purposes. Although CBA and FISH are still the gold standard in hematology for the detection of CNA and SV, some alternative technologies are intended to complement their deficiencies or even replace them in the more or less near future. In this article, we provide a technological overview of these alternatives. CMA is the historical and well established technique for the high-resolution detection of CNA. For SV detection, there are emerging techniques based on the study of chromatin conformation and more established ones such as RTMLPA for the detection of fusion transcripts and RNA-seq to reveal the molecular consequences of SV. Comprehensive techniques that detect both CNA and SV are the most interesting because they provide all the information in a single examination. Among these, OGM is a promising emerging higher-solution technique that offers a complete solution at a contained cost, at the expense of a relatively low throughput per machine. WGS remains the most adaptable solution, with long-read approaches enabling very high-resolution detection of CAs, but requiring a heavy bioinformatics installation and at a still high cost. However, the development of high-resolution genome-wide detection techniques for CAs allows for a much better description of chromoanagenesis. Therefore, we have included in this review an update on the various existing mechanisms and their consequences and implications, especially prognostic, in hematological malignancies.
Collapse
Affiliation(s)
| | | | | | - Jean-Baptiste Gaillard
- Unité de Génétique Chromosomique, Service de Génétique moléculaire et cytogénomique, CHU Montpellier, Montpellier, France
| | | |
Collapse
|
8
|
Lysenkova Wiklander M, Arvidsson G, Bunikis I, Lundmark A, Raine A, Marincevic-Zuniga Y, Gezelius H, Bremer A, Feuk L, Ameur A, Nordlund J. A multiomic characterization of the leukemia cell line REH using short- and long-read sequencing. Life Sci Alliance 2024; 7:e202302481. [PMID: 38777370 PMCID: PMC11111970 DOI: 10.26508/lsa.202302481] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Revised: 05/02/2024] [Accepted: 05/02/2024] [Indexed: 05/25/2024] Open
Abstract
The B-cell acute lymphoblastic leukemia (ALL) cell line REH, with the t(12;21) ETV6::RUNX1 translocation, is known to have a complex karyotype defined by a series of large-scale chromosomal rearrangements. Taken from a 15-yr-old at relapse, the cell line offers a practical model for the study of pediatric B-ALL. In recent years, short- and long-read DNA and RNA sequencing have emerged as a complement to karyotyping techniques in the resolution of structural variants in an oncological context. Here, we explore the integration of long-read PacBio and Oxford Nanopore whole-genome sequencing, IsoSeq RNA sequencing, and short-read Illumina sequencing to create a detailed genomic and transcriptomic characterization of the REH cell line. Whole-genome sequencing clarified the molecular traits of disrupted ALL-associated genes including CDKN2A, PAX5, BTG1, VPREB1, and TBL1XR1, as well as the glucocorticoid receptor NR3C1 Meanwhile, transcriptome sequencing identified seven fusion genes within the genomic breakpoints. Together, our extensive whole-genome investigation makes high-quality open-source data available to the leukemia genomics community.
Collapse
Affiliation(s)
- Mariya Lysenkova Wiklander
- Department of Medical Sciences, Uppsala University, Uppsala, Sweden
- SciLifeLab, Uppsala University, Uppsala, Sweden
| | - Gustav Arvidsson
- Department of Medical Sciences, Uppsala University, Uppsala, Sweden
- SciLifeLab, Uppsala University, Uppsala, Sweden
| | - Ignas Bunikis
- SciLifeLab, Uppsala University, Uppsala, Sweden
- Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden
- National Genomics Infrastructure, Uppsala University, Uppsala, Sweden
| | - Anders Lundmark
- Department of Medical Sciences, Uppsala University, Uppsala, Sweden
- SciLifeLab, Uppsala University, Uppsala, Sweden
| | - Amanda Raine
- Department of Medical Sciences, Uppsala University, Uppsala, Sweden
- SciLifeLab, Uppsala University, Uppsala, Sweden
- National Genomics Infrastructure, Uppsala University, Uppsala, Sweden
| | - Yanara Marincevic-Zuniga
- Department of Medical Sciences, Uppsala University, Uppsala, Sweden
- SciLifeLab, Uppsala University, Uppsala, Sweden
- National Genomics Infrastructure, Uppsala University, Uppsala, Sweden
| | - Henrik Gezelius
- Department of Medical Sciences, Uppsala University, Uppsala, Sweden
- SciLifeLab, Uppsala University, Uppsala, Sweden
- National Genomics Infrastructure, Uppsala University, Uppsala, Sweden
| | - Anna Bremer
- SciLifeLab, Uppsala University, Uppsala, Sweden
- Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden
- Department of Clinical Genetics, Uppsala University Hospital, Uppsala, Sweden
| | - Lars Feuk
- SciLifeLab, Uppsala University, Uppsala, Sweden
- Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden
- National Genomics Infrastructure, Uppsala University, Uppsala, Sweden
| | - Adam Ameur
- SciLifeLab, Uppsala University, Uppsala, Sweden
- Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden
- National Genomics Infrastructure, Uppsala University, Uppsala, Sweden
| | - Jessica Nordlund
- Department of Medical Sciences, Uppsala University, Uppsala, Sweden
- SciLifeLab, Uppsala University, Uppsala, Sweden
- National Genomics Infrastructure, Uppsala University, Uppsala, Sweden
| |
Collapse
|
9
|
Szakállas N, Barták BK, Valcz G, Nagy ZB, Takács I, Molnár B. Can long-read sequencing tackle the barriers, which the next-generation could not? A review. Pathol Oncol Res 2024; 30:1611676. [PMID: 38818014 PMCID: PMC11137202 DOI: 10.3389/pore.2024.1611676] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/10/2024] [Accepted: 04/30/2024] [Indexed: 06/01/2024]
Abstract
The large-scale heterogeneity of genetic diseases necessitated the deeper examination of nucleotide sequence alterations enhancing the discovery of new targeted drug attack points. The appearance of new sequencing techniques was essential to get more interpretable genomic data. In contrast to the previous short-reads, longer lengths can provide a better insight into the potential health threatening genetic abnormalities. Long-reads offer more accurate variant identification and genome assembly methods, indicating advances in nucleotide deflect-related studies. In this review, we introduce the historical background of sequencing technologies and show their benefits and limits, as well. Furthermore, we highlight the differences between short- and long-read approaches, including their unique advances and difficulties in methodologies and evaluation. Additionally, we provide a detailed description of the corresponding bioinformatics and the current applications.
Collapse
Affiliation(s)
- Nikolett Szakállas
- Department of Biological Physics, Faculty of Science, Eötvös Loránd University, Budapest, Hungary
| | - Barbara K. Barták
- Department of Internal Medicine and Oncology, Faculty of Medicine, Semmelweis University, Budapest, Hungary
| | - Gábor Valcz
- Department of Internal Medicine and Oncology, Faculty of Medicine, Semmelweis University, Budapest, Hungary
- HUN-REN-SU Translational Extracellular Vesicle Research Group, Budapest, Hungary
| | - Zsófia B. Nagy
- Department of Internal Medicine and Oncology, Faculty of Medicine, Semmelweis University, Budapest, Hungary
| | - István Takács
- Department of Internal Medicine and Oncology, Faculty of Medicine, Semmelweis University, Budapest, Hungary
| | - Béla Molnár
- Department of Internal Medicine and Oncology, Faculty of Medicine, Semmelweis University, Budapest, Hungary
| |
Collapse
|
10
|
Joe S, Park JL, Kim J, Kim S, Park JH, Yeo MK, Lee D, Yang JO, Kim SY. Comparison of structural variant callers for massive whole-genome sequence data. BMC Genomics 2024; 25:318. [PMID: 38549092 PMCID: PMC10976732 DOI: 10.1186/s12864-024-10239-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 03/18/2024] [Indexed: 04/01/2024] Open
Abstract
BACKGROUND Detecting structural variations (SVs) at the population level using next-generation sequencing (NGS) requires substantial computational resources and processing time. Here, we compared the performances of 11 SV callers: Delly, Manta, GridSS, Wham, Sniffles, Lumpy, SvABA, Canvas, CNVnator, MELT, and INSurVeyor. These SV callers have been recently published and have been widely employed for processing massive whole-genome sequencing datasets. We evaluated the accuracy, sequence depth, running time, and memory usage of the SV callers. RESULTS Notably, several callers exhibited better calling performance for deletions than for duplications, inversions, and insertions. Among the SV callers, Manta identified deletion SVs with better performance and efficient computing resources, and both Manta and MELT demonstrated relatively good precision regarding calling insertions. We confirmed that the copy number variation callers, Canvas and CNVnator, exhibited better performance in identifying long duplications as they employ the read-depth approach. Finally, we also verified the genotypes inferred from each SV caller using a phased long-read assembly dataset, and Manta showed the highest concordance in terms of the deletions and insertions. CONCLUSIONS Our findings provide a comprehensive understanding of the accuracy and computational efficiency of SV callers, thereby facilitating integrative analysis of SV profiles in diverse large-scale genomic datasets.
Collapse
Grants
- NRF-2020M3E5D708517212, 2020M3A9I6A0103605713 Ministry of Science and ICT, South Korea
- NRF-2020M3E5D708517212, 2020M3A9I6A0103605713 Ministry of Science and ICT, South Korea
- NRF-2020M3E5D708517212, 2020M3A9I6A0103605713 Ministry of Science and ICT, South Korea
- NRF-2020M3E5D708517212, 2020M3A9I6A0103605713 Ministry of Science and ICT, South Korea
- NRF-2020M3E5D708517212, 2020M3A9I6A0103605713 Ministry of Science and ICT, South Korea
- NRF-2020M3E5D708517212, 2020M3A9I6A0103605713 Ministry of Science and ICT, South Korea
- NRF-2020M3E5D708517212, 2020M3A9I6A0103605713 Ministry of Science and ICT, South Korea
- NRF-2020M3E5D708517212, 2020M3A9I6A0103605713 Ministry of Science and ICT, South Korea
- NTIS-1711170620 KRIBB Research Initiative Program
- NTIS-1711170620 KRIBB Research Initiative Program
- NTIS-1711170620 KRIBB Research Initiative Program
- NTIS-1711170620 KRIBB Research Initiative Program
- NTIS-1711170620 KRIBB Research Initiative Program
- NTIS-1711170620 KRIBB Research Initiative Program
- NTIS-1711170620 KRIBB Research Initiative Program
Collapse
Affiliation(s)
- Soobok Joe
- Korea Bioinformation Center (KOBIC), Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea
| | - Jong-Lyul Park
- Aging Convergence Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea
- Department of Functional Genomics, University of Science and Technology (UST), 34113, Daejeon, Republic of Korea
| | - Jun Kim
- Department of Convergent Bioscience and Informatics, College of Bioscience and Biotechnology, Chungnam National University, Daejeon, 34134, Republic of Korea
| | - Sangok Kim
- Korea Bioinformation Center (KOBIC), Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea
| | - Ji-Hwan Park
- Korea Bioinformation Center (KOBIC), Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea
- Department of Bioscience, University of Science and Technology (UST), Daejeon, 34113, Republic of Korea
| | - Min-Kyung Yeo
- Department of Pathology, Chungnam National University School of Medicine, Daejeon, 35015, Republic of Korea
| | - Dongyoon Lee
- Korea Bioinformation Center (KOBIC), Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea
| | - Jin Ok Yang
- Korea Bioinformation Center (KOBIC), Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea.
- Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Republic of Korea.
| | - Seon-Young Kim
- Korea Bioinformation Center (KOBIC), Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea.
- Department of Bioscience, University of Science and Technology (UST), Daejeon, 34113, Republic of Korea.
| |
Collapse
|
11
|
Timmaraju VA, Finkelstein SD, Levine JA. Analytical Validation of Loss of Heterozygosity and Mutation Detection in Pancreatic Fine-Needle Aspirates by Capillary Electrophoresis and Sanger Sequencing. Diagnostics (Basel) 2024; 14:514. [PMID: 38472986 DOI: 10.3390/diagnostics14050514] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Revised: 02/15/2024] [Accepted: 02/23/2024] [Indexed: 03/14/2024] Open
Abstract
Pancreatic cystic disease, including duct dilation, represents precursor states towards the development of pancreatic cancer, a form of malignancy with relatively low incidence but high mortality. While most of these cysts (>85%) are benign, the remainder can progress over time, leading to malignant transformation, invasion, and metastasis. Cytologic diagnosis is challenging, limited by the paucity or complete absence of cells representative of cystic lesions and fibrosis. Molecular analysis of fluids collected from endoscopic-guided fine-needle aspiration of pancreatic cysts and dilated duct lesions can be used to evaluate the risk of progression to malignancy. The basis for the enhanced diagnostic utility of molecular approaches is the ability to interrogate cell-free nucleic acid of the cyst/duct and/or extracellular fluid. The allelic imbalances at tumor suppressor loci and the selective oncogenic drivers are used clinically to help differentiate benign stable pancreatic cysts from those progressing toward high-grade dysplasia. Methods are discussed and used to determine the efficacy for diagnostic implementation. Here, we report the analytical validation of methods to detect causally associated molecular changes integral to the pathogenesis of pancreatic cancer from pancreatic cyst fluids.
Collapse
|
12
|
Meng X, Wang M, Luo M, Sun L, Yan Q, Liu Y. Systematic evaluation of multiple NGS platforms for structural variants detection. J Biol Chem 2023; 299:105436. [PMID: 37944616 PMCID: PMC10724692 DOI: 10.1016/j.jbc.2023.105436] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Revised: 10/29/2023] [Accepted: 10/31/2023] [Indexed: 11/12/2023] Open
Abstract
Structural variations (SV) are critical genome changes affecting human diseases. Although many hybridization-based methods exist, evaluating SVs through next-generation sequencing (NGS) data is still necessary for broader research exploration. Here, we comprehensively compared the performance of 16 SV callers and multiple NGS platforms using NA12878 whole genome sequencing (WGS) datasets. The results indicated that several SV callers performed well relatively, such as Manta, GRIDSS, LUMPY, TARDIS, FermiKit, and Wham. Meanwhile, all NGS platforms have a similar performance using a single software. Additionally, we found that the source of undetected SVs was mostly from long reads datasets, therefore, the more appropriate strategy for accurate SV detection will be an integration of long and shorter reads in the future. At present, in the period of NGS as a mainstream method in bioinformatics, our study would provide helpful and comprehensive guidelines for specific categories of SV research.
Collapse
Affiliation(s)
- Xuan Meng
- School of Medicine, Southern University of Science and Technology, Shenzhen, China
| | - Miao Wang
- Research Cooperation Department, GeneMind Biosciences Company Limited, Shenzhen, China
| | - Mingjie Luo
- Research Cooperation Department, GeneMind Biosciences Company Limited, Shenzhen, China
| | - Lei Sun
- Research Cooperation Department, GeneMind Biosciences Company Limited, Shenzhen, China
| | - Qin Yan
- Research Cooperation Department, GeneMind Biosciences Company Limited, Shenzhen, China
| | - Yongfeng Liu
- Research Cooperation Department, GeneMind Biosciences Company Limited, Shenzhen, China.
| |
Collapse
|
13
|
Yu Y. Emerging trends in clinical cancer genomic research. Cancer Biol Med 2023; 20:j.issn.2095-3941.2023.0383. [PMID: 38018030 PMCID: PMC10690877 DOI: 10.20892/j.issn.2095-3941.2023.0383] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Accepted: 11/07/2023] [Indexed: 11/30/2023] Open
Affiliation(s)
- Yingyan Yu
- Department of General Surgery of Ruijin Hospital, Shanghai Institute of Digestive Surgery, and Shanghai Key Laboratory for Gastric Neoplasms, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| |
Collapse
|
14
|
Bush ZD, Naftaly AFS, Dinwiddie D, Albers C, Hillers KJ, Libuda DE. Comprehensive detection of structural variation and transposable element differences between wild type laboratory lineages of C. elegans. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.13.523974. [PMID: 37961628 PMCID: PMC10634987 DOI: 10.1101/2023.01.13.523974] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
Genomic structural variations (SVs) and transposable elements (TEs) can be significant contributors to genome evolution, altered gene expression, and risk of genetic diseases. Recent advancements in long-read sequencing have greatly improved the quality of de novo genome assemblies and enhanced the detection of sequence variants at the scale of hundreds or thousands of bases. Comparisons between two diverged wild isolates of Caenorhabditis elegans, the Bristol and Hawaiian strains, have been widely utilized in the analysis of small genetic variations. Genetic drift, including SVs and rearrangements of repeated sequences such as TEs, can occur over time from long-term maintenance of wild type isolates within the laboratory. To comprehensively detect both large and small structural variations as well as TEs due to genetic drift, we generated de novo genome assemblies and annotations for each strain from our lab collection using both long- and short-read sequencing and compared our assemblies and annotations with that of other lab wild type strains. Within our lab assemblies, we annotate over 3.1Mb of sequence divergence between the Bristol and Hawaiian isolates: 337,584 SNPs, 94,503 small insertion-deletions (<50bp), and 4,334 structural variations (>50bp). Further, we define the location and movement of specific DNA TEs between N2 Bristol and CB4856 Hawaiian wild type isolates. Specifically, we find the N2 Bristol genome has 20.6% more TEs from the Tc1/mariner family than the CB4856 Hawaiian genome. Moreover, we identified Zator elements as the most abundant and mobile TE family in the genome. Using specific TE sequences with unique SNPs, we also identify 38 TEs that moved intrachromosomally and 9 TEs that moved interchromosomally between the N2 Bristol and CB4856 Hawaiian genomes. By comparing the de novo genome assembly of our lab collection Bristol isolate to the VC2010 Bristol assembly, we also reveal that lab lineages display over 2 Mb of total variation: 1,162 SNPs, 1,528 indels, and 897 SVs with 95% of the variation due to SVs. Overall, our work demonstrates the unique contribution of SVs and TEs to variation and genetic drift between wild type laboratory strains assumed to be isogenic despite growing evidence of genetic drift and phenotypic variation.
Collapse
Affiliation(s)
- Zachary D. Bush
- Institute of Molecular Biology, Department of Biology, University of Oregon, 1229 Franklin Blvd Eugene, OR 97403, USA
| | - Alice F. S. Naftaly
- Institute of Molecular Biology, Department of Biology, University of Oregon, 1229 Franklin Blvd Eugene, OR 97403, USA
| | - Devin Dinwiddie
- Institute of Molecular Biology, Department of Biology, University of Oregon, 1229 Franklin Blvd Eugene, OR 97403, USA
| | - Cora Albers
- Institute of Molecular Biology, Department of Biology, University of Oregon, 1229 Franklin Blvd Eugene, OR 97403, USA
| | - Kenneth J. Hillers
- Biological Sciences Department, California Polytechnic State University, San Luis Obispo, California, USA
| | - Diana E. Libuda
- Institute of Molecular Biology, Department of Biology, University of Oregon, 1229 Franklin Blvd Eugene, OR 97403, USA
| |
Collapse
|
15
|
Hansen MH, Cédile O, Kjeldsen MLG, Thomassen M, Preiss B, von Neuhoff N, Abildgaard N, Nyvold CG. Toward Cytogenomics: Technical Assessment of Long-Read Nanopore Whole-Genome Sequencing for Detecting Large Chromosomal Alterations in Mantle Cell Lymphoma. J Mol Diagn 2023; 25:796-805. [PMID: 37683892 DOI: 10.1016/j.jmoldx.2023.08.004] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Revised: 06/20/2023] [Accepted: 08/14/2023] [Indexed: 09/10/2023] Open
Abstract
The current advances and success of next-generation sequencing hold the potential for the transition of cancer cytogenetics toward comprehensive cytogenomics. However, the conventional use of short reads impedes the resolution of chromosomal aberrations. Thus, this study evaluated the detection and reproducibility of extensive copy number alterations and chromosomal translocations using long-read Oxford Nanopore Technologies whole-genome sequencing compared with short-read Illumina sequencing. Using the mantle cell lymphoma cell line Granta-519, almost 99% copy-number reproducibility at the 100-kilobase resolution between replicates was demonstrated, with 98% concordance to Illumina. Collectively, the performance of copy number calling from 1.5 million to 7.5 million long reads was comparable to 1 billion Illumina-based reads (50× coverage). Expectedly, the long-read resolution of canonical translocation t(11;14)(q13;q32) was superior, with a sequence similarity of 89% to the already published CCND1/IGH junction (9× coverage), spanning up to 69 kilobases. The cytogenetic profile of Granta-519 was in general agreement with the literature and karyotype, although several differences remained unresolved. In conclusion, contemporary long-read sequencing is primed for future cytogenomics or sequencing-guided cytogenetics. The combined strength of long- and short-read sequencing is apparent, where the high-precision junctional mapping complements and splits paired-end reads. The potential is emphasized by the flexible single-sample genomic data acquisition of Oxford Nanopore Technologies with the high resolution of allelic imbalances using Illumina short-read sequencing.
Collapse
Affiliation(s)
- Marcus H Hansen
- Hematology-Pathology Research Laboratory, Research Unit of Hematology and Research Unit of Pathology, University of Southern Denmark and Odense University Hospital, Odense, Denmark; Department of Hematology, Odense University Hospital, Odense, Denmark.
| | - Oriane Cédile
- Hematology-Pathology Research Laboratory, Research Unit of Hematology and Research Unit of Pathology, University of Southern Denmark and Odense University Hospital, Odense, Denmark; Department of Hematology, Odense University Hospital, Odense, Denmark; OPEN, Odense Patient Data Explorative Network, Odense University Hospital, Odense, Denmark
| | - Marie L G Kjeldsen
- Hematology-Pathology Research Laboratory, Research Unit of Hematology and Research Unit of Pathology, University of Southern Denmark and Odense University Hospital, Odense, Denmark
| | - Mads Thomassen
- Department of Clinical Genetics, Odense University Hospital, Odense, Denmark
| | - Birgitte Preiss
- Hematology-Pathology Research Laboratory, Research Unit of Hematology and Research Unit of Pathology, University of Southern Denmark and Odense University Hospital, Odense, Denmark; Department of Pathology, Odense University Hospital, Odense, Denmark
| | - Nils von Neuhoff
- Department of Pediatric Hematology and Oncology, Essen University Hospital and University of Duisburg-Essen, Essen, Germany
| | - Niels Abildgaard
- Hematology-Pathology Research Laboratory, Research Unit of Hematology and Research Unit of Pathology, University of Southern Denmark and Odense University Hospital, Odense, Denmark; Department of Hematology, Odense University Hospital, Odense, Denmark
| | - Charlotte G Nyvold
- Hematology-Pathology Research Laboratory, Research Unit of Hematology and Research Unit of Pathology, University of Southern Denmark and Odense University Hospital, Odense, Denmark; Department of Hematology, Odense University Hospital, Odense, Denmark; OPEN, Odense Patient Data Explorative Network, Odense University Hospital, Odense, Denmark
| |
Collapse
|
16
|
Romagnoli S, Bartalucci N, Vannucchi AM. Resolving complex structural variants via nanopore sequencing. Front Genet 2023; 14:1213917. [PMID: 37674481 PMCID: PMC10479017 DOI: 10.3389/fgene.2023.1213917] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Accepted: 07/26/2023] [Indexed: 09/08/2023] Open
Abstract
The recent development of high-throughput sequencing platforms provided impressive insights into the field of human genetics and contributed to considering structural variants (SVs) as the hallmark of genome instability, leading to the establishment of several pathologic conditions, including neoplasia and neurodegenerative and cognitive disorders. While SV detection is addressed by next-generation sequencing (NGS) technologies, the introduction of more recent long-read sequencing technologies have already been proven to be invaluable in overcoming the inaccuracy and limitations of NGS technologies when applied to resolve wide and structurally complex SVs due to the short length (100-500 bp) of the sequencing read utilized. Among the long-read sequencing technologies, Oxford Nanopore Technologies developed a sequencing platform based on a protein nanopore that allows the sequencing of "native" long DNA molecules of virtually unlimited length (typical range 1-100 Kb). In this review, we focus on the bioinformatics methods that improve the identification and genotyping of known and novel SVs to investigate human pathological conditions, discussing the possibility of introducing nanopore sequencing technology into routine diagnostics.
Collapse
Affiliation(s)
| | | | - Alessandro Maria Vannucchi
- CRIMM, Center of Research and Innovation of Myeloproliferative Neoplasms, DENOTHE Excellence Center, Careggi University Hospital and Department of Experimental and Clinical Medicine, University of Florence, Florence, Italy
| |
Collapse
|
17
|
Wong J, Coombe L, Nikolić V, Zhang E, Nip KM, Sidhu P, Warren RL, Birol I. Linear time complexity de novo long read genome assembly with GoldRush. Nat Commun 2023; 14:2906. [PMID: 37217507 PMCID: PMC10202940 DOI: 10.1038/s41467-023-38716-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Accepted: 05/11/2023] [Indexed: 05/24/2023] Open
Abstract
Current state-of-the-art de novo long read genome assemblers follow the Overlap-Layout-Consensus paradigm. While read-to-read overlap - its most costly step - was improved in modern long read genome assemblers, these tools still often require excessive RAM when assembling a typical human dataset. Our work departs from this paradigm, foregoing all-vs-all sequence alignments in favor of a dynamic data structure implemented in GoldRush, a de novo long read genome assembly algorithm with linear time complexity. We tested GoldRush on Oxford Nanopore Technologies long sequencing read datasets with different base error profiles sourced from three human cell lines, rice, and tomato. Here, we show that GoldRush achieves assembly scaffold NGA50 lengths of 18.3-22.2, 0.3 and 2.6 Mbp, for the genomes of human, rice, and tomato, respectively, and assembles each genome within a day, using at most 54.5 GB of random-access memory, demonstrating the scalability of our genome assembly paradigm and its implementation.
Collapse
Affiliation(s)
- Johnathan Wong
- Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, BC, V5Z 4S6, Canada.
| | - Lauren Coombe
- Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, BC, V5Z 4S6, Canada
| | - Vladimir Nikolić
- Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, BC, V5Z 4S6, Canada
| | - Emily Zhang
- Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, BC, V5Z 4S6, Canada
| | - Ka Ming Nip
- Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, BC, V5Z 4S6, Canada
| | - Puneet Sidhu
- Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, BC, V5Z 4S6, Canada
| | - René L Warren
- Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, BC, V5Z 4S6, Canada
| | - Inanç Birol
- Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, BC, V5Z 4S6, Canada.
| |
Collapse
|
18
|
Rausch T, Snajder R, Leger A, Simovic M, Giurgiu M, Villacorta L, Henssen AG, Fröhling S, Stegle O, Birney E, Bonder MJ, Ernst A, Korbel JO. Long-read sequencing of diagnosis and post-therapy medulloblastoma reveals complex rearrangement patterns and epigenetic signatures. CELL GENOMICS 2023; 3:100281. [PMID: 37082141 PMCID: PMC10112291 DOI: 10.1016/j.xgen.2023.100281] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/20/2022] [Revised: 06/14/2022] [Accepted: 02/22/2023] [Indexed: 04/22/2023]
Abstract
Cancer genomes harbor a broad spectrum of structural variants (SVs) driving tumorigenesis, a relevant subset of which escape discovery using short-read sequencing. We employed Oxford Nanopore Technologies (ONT) long-read sequencing in a paired diagnostic and post-therapy medulloblastoma to unravel the haplotype-resolved somatic genetic and epigenetic landscape. We assembled complex rearrangements, including a 1.55-Mbp chromothripsis event, and we uncover a complex SV pattern termed templated insertion (TI) thread, characterized by short (mostly <1 kb) insertions showing prevalent self-concatenation into highly amplified structures of up to 50 kbp in size. TI threads occur in 3% of cancers, with a prevalence up to 74% in liposarcoma, and frequent colocalization with chromothripsis. We also perform long-read-based methylome profiling and discover allele-specific methylation (ASM) effects, complex rearrangements exhibiting differential methylation, and differential promoter methylation in cancer-driver genes. Our study shows the advantage of long-read sequencing in the discovery and characterization of complex somatic rearrangements.
Collapse
Affiliation(s)
- Tobias Rausch
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
- European Molecular Biology Laboratory (EMBL), GeneCore, Heidelberg, Germany
| | - Rene Snajder
- Division of Computational Genomics and Systems Genetics, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Faculty for Biosciences, Heidelberg University, Heidelberg, Germany
- HIDSS4Health, Helmholtz Information and Data Science School for Health, Heidelberg, Germany
| | - Adrien Leger
- European Molecular Biology Laboratory (EMBL), European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Milena Simovic
- Group “Genome Instability in Tumors,” German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Mădălina Giurgiu
- Experimental and Clinical Research Center (ECRC) of the Max Delbrück Center (MDC) and Charité-Universitätsmedizin, Berlin, Germany
- Freie Universität Berlin, Berlin, Germany
| | - Laura Villacorta
- European Molecular Biology Laboratory (EMBL), GeneCore, Heidelberg, Germany
| | - Anton G. Henssen
- Department of Pediatric Oncology/Hematology, Charité-Universitätsmedizin, Berlin, Germany
- Experimental and Clinical Research Center (ECRC) of the Max Delbrück Center (MDC) and Charité-Universitätsmedizin, Berlin, Germany
- German Cancer Consortium (DKTK), partner site Berlin, and German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Stefan Fröhling
- National Center for Tumor Diseases (NCT), Heidelberg, Germany
- German Cancer Research Center (DKFZ), Heidelberg, Germany
- German Cancer Consortium (DKTK), Heidelberg, Germany
| | - Oliver Stegle
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
- Division of Computational Genomics and Systems Genetics, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Wellcome Sanger Institute, Wellcome Trust Genome Campus, Cambridge, UK
| | - Ewan Birney
- European Molecular Biology Laboratory (EMBL), European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Marc Jan Bonder
- Division of Computational Genomics and Systems Genetics, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Aurelie Ernst
- Group “Genome Instability in Tumors,” German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Jan O. Korbel
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
- European Molecular Biology Laboratory (EMBL), European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
- Bridging Research Division on Mechanisms of Genomic Variation and Data Science, DKFZ, Heidelberg, Germany
| |
Collapse
|
19
|
Abstract
Nanopore sensing is a disruptive, revolutionary way in which to sequence nucleic acids, including both native DNA and RNA molecules. First commercialized with the MinIONTM sequencer from Oxford Nanopore TechnologiesTM in 2015, this review article looks at the current state of nanopore sequencing as of June 2022. Covering the unique characteristics of the technology and how it functions, we then go on to look at the ability of the platform to deliver sequencing at all scales-from personal to high-throughput devices-before looking at how the scientific community is applying the technology around the world to answer their biological questions.
Collapse
|
20
|
Long-Read Whole-Genome Sequencing Using a Nanopore Sequencer and Detection of Structural Variants in Cancer Genomes. Methods Mol Biol 2023; 2632:177-189. [PMID: 36781729 DOI: 10.1007/978-1-0716-2996-3_13] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/15/2023]
Abstract
Long-read sequencing technologies enable us to precisely identify structural variants (SVs), which would be occasionally associated with various types of diseases, including cancers. In this section, we introduce experimental and computational procedures for conducting long-read whole-genome sequencing (WGS) of cancer genomes from fresh frozen tissues/cells. We also demonstrate the analysis of SVs in cancer genomes using long-read WGS data from lung cancer cell lines by several representative computational tools, such as cuteSV and Sniffles2, as examples.
Collapse
|
21
|
Bonenfant Q, Noé L, Touzet H. Porechop_ABI: discovering unknown adapters in Oxford Nanopore Technology sequencing reads for downstream trimming. BIOINFORMATICS ADVANCES 2022; 3:vbac085. [PMID: 36698762 PMCID: PMC9869717 DOI: 10.1093/bioadv/vbac085] [Citation(s) in RCA: 49] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 10/07/2022] [Indexed: 11/23/2022]
Abstract
Motivation Oxford Nanopore Technologies (ONT) sequencing has become very popular over the past few years and offers a cost-effective solution for many genomic and transcriptomic projects. One distinctive feature of the technology is that the protocol includes the ligation of adapters to both ends of each fragment. Those adapters should then be removed before downstream analyses, either during the basecalling step or by explicit trimming. This basic task may be tricky when the definition of the adapter sequence is not well documented. Results We have developed a new method to scan a set of ONT reads to see if it contains adapters, without any prior knowledge on the sequence of the potential adapters, and then trim out those adapters. The algorithm is based on approximate k-mers and is able to discover adapter sequences based on their frequency alone. The method was successfully tested on a variety of ONT datasets with different flowcells, sequencing kits and basecallers. Availability and implementation The resulting software, named Porechop_ABI, is open-source and is available at https://github.com/bonsai-team/Porechop_ABI. Supplementary information Supplementary data are available at Bioinformatics advances online.
Collapse
Affiliation(s)
- Quentin Bonenfant
- Univ. Lille, CNRS, Centrale Lille, UMR 9189 - CRIStAL—Centre de Recherche en Informatique Signal et Automatique de Lille, Lille F-59000, France
| | - Laurent Noé
- Univ. Lille, CNRS, Centrale Lille, UMR 9189 - CRIStAL—Centre de Recherche en Informatique Signal et Automatique de Lille, Lille F-59000, France
| | | |
Collapse
|
22
|
Gindin T, Hsiao SJ. Analytical Principles of Cancer Next Generation Sequencing. Clin Lab Med 2022; 42:395-408. [DOI: 10.1016/j.cll.2022.04.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
23
|
Leipart V, Ludvigsen J, Kent M, Sandve S, To T, Árnyasi M, Kreibich CD, Dahle B, Amdam GV. Identification of 121 variants of honey bee Vitellogenin protein sequences with structural differences at functional sites. Protein Sci 2022; 31:e4369. [PMID: 35762708 PMCID: PMC9207902 DOI: 10.1002/pro.4369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Accepted: 05/21/2022] [Indexed: 12/04/2022]
Abstract
Proteins are under selection to maintain central functions and to accommodate needs that arise in ever-changing environments. The positive selection and neutral drift that preserve functions result in a diversity of protein variants. The amount of diversity differs between proteins: multifunctional or disease-related proteins tend to have fewer variants than proteins involved in some aspects of immunity. Our work focuses on the extensively studied protein Vitellogenin (Vg), which in honey bees (Apis mellifera) is multifunctional and highly expressed and plays roles in immunity. Yet, almost nothing is known about the natural variation in the coding sequences of this protein or how amino acid-altering variants might impact structure-function relationships. Here, we map out allelic variation in honey bee Vg using biological samples from 15 countries. The successful barcoded amplicon Nanopore sequencing of 543 bees revealed 121 protein variants, indicating a high level of diversity in Vg. We find that the distribution of non-synonymous single nucleotide polymorphisms (nsSNPs) differs between protein regions with different functions; domains involved in DNA and protein-protein interactions contain fewer nsSNPs than the protein's lipid binding cavities. We outline how the central functions of the protein can be maintained in different variants and how the variation pattern may inform about selection from pathogens and nutrition.
Collapse
Affiliation(s)
- Vilde Leipart
- Faculty of Environmental Sciences and Natural Resource ManagementNorwegian University of Life SciencesÅsNorway
| | - Jane Ludvigsen
- Faculty of Environmental Sciences and Natural Resource ManagementNorwegian University of Life SciencesÅsNorway
- Fürst Medisinsk LaboratoriumOsloNorway
| | - Matthew Kent
- Department of Animal and Aquacultural Sciences, Centre for Integrative Genetics (CIGENE)Norwegian University of Life SciencesÅsNorway
| | - Simen Sandve
- Department of Animal and Aquacultural Sciences, Centre for Integrative Genetics (CIGENE)Norwegian University of Life SciencesÅsNorway
| | - Thu‐Hien To
- Department of Animal and Aquacultural Sciences, Centre for Integrative Genetics (CIGENE)Norwegian University of Life SciencesÅsNorway
| | - Mariann Árnyasi
- Department of Animal and Aquacultural Sciences, Centre for Integrative Genetics (CIGENE)Norwegian University of Life SciencesÅsNorway
| | - Claus D. Kreibich
- Faculty of Environmental Sciences and Natural Resource ManagementNorwegian University of Life SciencesÅsNorway
| | - Bjørn Dahle
- Faculty of Environmental Sciences and Natural Resource ManagementNorwegian University of Life SciencesÅsNorway
- Norwegian Beekeepers AssociationKløftaNorway
| | - Gro V. Amdam
- Faculty of Environmental Sciences and Natural Resource ManagementNorwegian University of Life SciencesÅsNorway
- School of Life SciencesArizona State UniversityTempeArizonaUSA
| |
Collapse
|
24
|
Hamdan A, Ewing A. Unravelling the tumour genome: The evolutionary and clinical impacts of structural variants in tumourigenesis. J Pathol 2022; 257:479-493. [PMID: 35355264 PMCID: PMC9321913 DOI: 10.1002/path.5901] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2022] [Revised: 03/16/2022] [Accepted: 03/28/2022] [Indexed: 11/15/2022]
Abstract
Structural variants (SVs) represent a major source of aberration in tumour genomes. Given the diversity in the size and type of SVs present in tumours, the accurate detection and interpretation of SVs in tumours is challenging. New classes of complex structural events in tumours are discovered frequently, and the definitions of the genomic consequences of complex events are constantly being refined. Detailed analyses of short-read whole-genome sequencing (WGS) data from large tumour cohorts facilitate the interrogation of SVs at orders of magnitude greater scale and depth. However, the inherent technical limitations of short-read WGS prevent us from accurately detecting and investigating the impact of all the SVs present in tumours. The expanded use of long-read WGS will be critical for improving the accuracy of SV detection, and in fully resolving complex SV events, both of which are crucial for determining the impact of SVs on tumour progression and clinical outcome. Despite the present limitations, we demonstrate that SVs play an important role in tumourigenesis. In particular, SVs contribute significantly to late-stage tumour development and to intratumoural heterogeneity. The evolutionary trajectories of SVs represent a window into the clonal dynamics in tumours, a comprehensive understanding of which will be vital for influencing patient outcomes in the future. Recent findings have highlighted many clinical applications of SVs in cancer, from early detection to biomarkers for treatment response and prognosis. As the methods to detect and interpret SVs improve, elucidating the full breadth of the complex SV landscape and determining how these events modulate tumour evolution will improve our understanding of cancer biology and our ability to capitalise on the utility of SVs in the clinical management of cancer patients. © 2022 The Authors. The Journal of Pathology published by John Wiley & Sons Ltd on behalf of The Pathological Society of Great Britain and Ireland.
Collapse
Affiliation(s)
- Alhafidz Hamdan
- MRC Human Genetics Unit, Institute of Genetics and CancerUniversity of EdinburghEdinburghUK
- Cancer Research UK Edinburgh Centre, Institute of Genetics and CancerUniversity of EdinburghEdinburghUK
| | - Ailith Ewing
- MRC Human Genetics Unit, Institute of Genetics and CancerUniversity of EdinburghEdinburghUK
- Cancer Research UK Edinburgh Centre, Institute of Genetics and CancerUniversity of EdinburghEdinburghUK
| |
Collapse
|
25
|
Sakamoto Y, Miyake S, Oka M, Kanai A, Kawai Y, Nagasawa S, Shiraishi Y, Tokunaga K, Kohno T, Seki M, Suzuki Y, Suzuki A. Phasing analysis of lung cancer genomes using a long read sequencer. Nat Commun 2022; 13:3464. [PMID: 35710642 PMCID: PMC9203510 DOI: 10.1038/s41467-022-31133-6] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2021] [Accepted: 06/02/2022] [Indexed: 12/14/2022] Open
Abstract
Chromosomal backgrounds of cancerous mutations still remain elusive. Here, we conduct the phasing analysis of non-small cell lung cancer specimens of 20 Japanese patients. By the combinatory use of short and long read sequencing data, we obtain long phased blocks of 834 kb in N50 length with >99% concordance rate. By analyzing the obtained phasing information, we reveal that several cancer genomes harbor regions in which mutations are unevenly distributed to either of two haplotypes. Large-scale chromosomal rearrangement events, which resemble chromothripsis events but have smaller scales, occur on only one chromosome, and these events account for the observed biased distributions. Interestingly, the events are characteristic of EGFR mutation-positive lung adenocarcinomas. Further integration of long read epigenomic and transcriptomic data reveal that haploid chromosomes are not always at equivalent transcriptomic/epigenomic conditions. Distinct chromosomal backgrounds are responsible for later cancerous aberrations in a haplotype-specific manner.
Collapse
Affiliation(s)
- Yoshitaka Sakamoto
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, Japan
| | - Shuhei Miyake
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, Japan
| | - Miho Oka
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, Japan
- Ono Pharmaceutical Co., Ltd, Ibaraki, Japan
| | - Akinori Kanai
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, Japan
| | - Yosuke Kawai
- Genome Medical Science Project (Toyama), National Center for Global Health and Medicine, Tokyo, Japan
| | - Satoi Nagasawa
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, Japan
| | - Yuichi Shiraishi
- Division of Genome Analysis Platform Development, National Cancer Center Research Institute, Tokyo, Japan
| | - Katsushi Tokunaga
- Genome Medical Science Project (Toyama), National Center for Global Health and Medicine, Tokyo, Japan
| | - Takashi Kohno
- Division of Genome Biology, National Cancer Center Research Institute, Tokyo, Japan
| | - Masahide Seki
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, Japan
| | - Yutaka Suzuki
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, Japan.
| | - Ayako Suzuki
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, Japan.
| |
Collapse
|
26
|
Quan C, Lu H, Lu Y, Zhou G. Population-scale genotyping of structural variation in the era of long-read sequencing. Comput Struct Biotechnol J 2022; 20:2639-2647. [PMID: 35685364 PMCID: PMC9163579 DOI: 10.1016/j.csbj.2022.05.047] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Revised: 05/24/2022] [Accepted: 05/24/2022] [Indexed: 11/29/2022] Open
Abstract
Population-scale studies of structural variation (SV) are growing rapidly worldwide with the development of long-read sequencing technology, yielding a considerable number of novel SVs and complete gap-closed genome assemblies. Herein, we highlight recent studies using a hybrid sequencing strategy and present the challenges toward large-scale genotyping for SVs due to the reference bias. Genotyping SVs at a population scale remains challenging, which severely impacts genotype-based population genetic studies or genome-wide association studies of complex diseases. We summarize academic efforts to improve genotype quality through linear or graph representations of reference and alternative alleles. Graph-based genotypers capable of integrating diverse genetic information are effectively applied to large and diverse cohorts, contributing to unbiased downstream analysis. Meanwhile, there is still an urgent need in this field for efficient tools to construct complex graphs and perform sequence-to-graph alignments.
Collapse
Affiliation(s)
- Cheng Quan
- Department of Genetics & Integrative Omics, State Key Laboratory of Proteomics, National Center for Protein Sciences, Beijing Institute of Radiation Medicine, Beijing 100850, PR China
| | - Hao Lu
- Department of Genetics & Integrative Omics, State Key Laboratory of Proteomics, National Center for Protein Sciences, Beijing Institute of Radiation Medicine, Beijing 100850, PR China
| | - Yiming Lu
- Department of Genetics & Integrative Omics, State Key Laboratory of Proteomics, National Center for Protein Sciences, Beijing Institute of Radiation Medicine, Beijing 100850, PR China
- Hebei University, Baoding, Hebei Province 071002, PR China
| | - Gangqiao Zhou
- Department of Genetics & Integrative Omics, State Key Laboratory of Proteomics, National Center for Protein Sciences, Beijing Institute of Radiation Medicine, Beijing 100850, PR China
- Collaborative Innovation Center for Personalized Cancer Medicine, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu Province 211166, PR China
- Medical College of Guizhou University, Guiyang, Guizhou Province 550025, PR China
- Hebei University, Baoding, Hebei Province 071002, PR China
| |
Collapse
|
27
|
Marwaha S, Knowles JW, Ashley EA. A guide for the diagnosis of rare and undiagnosed disease: beyond the exome. Genome Med 2022; 14:23. [PMID: 35220969 PMCID: PMC8883622 DOI: 10.1186/s13073-022-01026-w] [Citation(s) in RCA: 173] [Impact Index Per Article: 57.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Accepted: 02/10/2022] [Indexed: 02/07/2023] Open
Abstract
Rare diseases affect 30 million people in the USA and more than 300-400 million worldwide, often causing chronic illness, disability, and premature death. Traditional diagnostic techniques rely heavily on heuristic approaches, coupling clinical experience from prior rare disease presentations with the medical literature. A large number of rare disease patients remain undiagnosed for years and many even die without an accurate diagnosis. In recent years, gene panels, microarrays, and exome sequencing have helped to identify the molecular cause of such rare and undiagnosed diseases. These technologies have allowed diagnoses for a sizable proportion (25-35%) of undiagnosed patients, often with actionable findings. However, a large proportion of these patients remain undiagnosed. In this review, we focus on technologies that can be adopted if exome sequencing is unrevealing. We discuss the benefits of sequencing the whole genome and the additional benefit that may be offered by long-read technology, pan-genome reference, transcriptomics, metabolomics, proteomics, and methyl profiling. We highlight computational methods to help identify regionally distant patients with similar phenotypes or similar genetic mutations. Finally, we describe approaches to automate and accelerate genomic analysis. The strategies discussed here are intended to serve as a guide for clinicians and researchers in the next steps when encountering patients with non-diagnostic exomes.
Collapse
Affiliation(s)
- Shruti Marwaha
- Department of Medicine, Division of Cardiovascular Medicine, School of Medicine, Stanford University, Stanford, CA, USA.
- Stanford Center for Undiagnosed Diseases, Stanford University, Stanford, CA, USA.
| | - Joshua W Knowles
- Department of Medicine, Division of Cardiovascular Medicine, School of Medicine, Stanford University, Stanford, CA, USA
- Department of Medicine, Diabetes Research Center, Cardiovascular Institute and Prevention Research Center, Stanford, CA, USA
| | - Euan A Ashley
- Department of Medicine, Division of Cardiovascular Medicine, School of Medicine, Stanford University, Stanford, CA, USA.
- Stanford Center for Undiagnosed Diseases, Stanford University, Stanford, CA, USA.
- Department of Genetics, School of Medicine, Stanford University, Stanford, CA, USA.
| |
Collapse
|
28
|
Transposable Elements and Human Diseases: Mechanisms and Implication in the Response to Environmental Pollutants. Int J Mol Sci 2022; 23:ijms23052551. [PMID: 35269693 PMCID: PMC8910135 DOI: 10.3390/ijms23052551] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Revised: 02/21/2022] [Accepted: 02/22/2022] [Indexed: 02/06/2023] Open
Abstract
Transposable elements (TEs) are recognized as major players in genome plasticity and evolution. The high abundance of TEs in the human genome, especially the Alu and Long Interspersed Nuclear Element-1 (LINE-1) repeats, makes them responsible for the molecular origin of several diseases. This involves several molecular mechanisms that are presented in this review: insertional mutation, DNA recombination and chromosomal rearrangements, modification of gene expression, as well as alteration of epigenetic regulations. This literature review also presents some of the more recent and/or more classical examples of human diseases in which TEs are involved. Whether through insertion of LINE-1 or Alu elements that cause chromosomal rearrangements, or through epigenetic modifications, TEs are widely implicated in the origin of human cancers. Many other human diseases can have a molecular origin in TE-mediated chromosomal recombination or alteration of gene structure and/or expression. These diseases are very diverse and include hemoglobinopathies, metabolic and neurological diseases, and common diseases. Moreover, TEs can also have an impact on aging. Finally, the exposure of individuals to stresses and environmental contaminants seems to have a non-negligible impact on the epigenetic derepression and mobility of TEs, which can lead to the development of diseases. Thus, improving our knowledge of TEs may lead to new potential diagnostic markers of diseases.
Collapse
|
29
|
The Tumor Dynamism Is the Dark Matter of the NGS Galaxy: How to Understand It? Cancers (Basel) 2021; 13:cancers13215476. [PMID: 34771638 PMCID: PMC8582436 DOI: 10.3390/cancers13215476] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Revised: 10/26/2021] [Accepted: 10/29/2021] [Indexed: 11/30/2022] Open
|